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DETECTION OF MICROS ATELLITE INSTABILITY AND ITS 
USE IN DIAGNOSIS OF TUMORS 

5 This invention was made using U.S. government Small Business Innovation 

Research Program Grant CA76834-02 from the National Institutes of Health, The 
U.S. government retains certain rights to the invention. 

TECHNICAL FIELD OF THE INVENTION 
10 This invention relates to the detection of instability in regions of genomic 

DNA containing simple tandem repeats, such as microsatellite loci. The invention 
particularly relates to multiplex analysis for the presence or absence of instability in a 
set of microsatellite loci in genomic DNA from cells, tissue, or bodily fluids 
originating from a tumor. The invention also relates to the use of microsatellite 
15 instability analysis in the detection and diagnosis of cancer and predisposition for 
cancer. 

BACKGROUND OF THE INVENTION 
Microsatellite loci of genomic DNA have been analyzed for a wide variety of 
20 applications, including, but not limited to, paternity testing, forensics work, and in the 
detection and diagnosis of cancer. Cancer can be detected or diagnosed based upon 
the presence of instability at particular microsatellite loci that are unstable in one or 
more types of tumor cells. 

A microsatellite locus is a region of genomic DNA with simple tandem 
25 repeats that are repetitive units of one to five base pairs in length. Hundreds of 
thousands of such microsatellite loci are dispersed throughout the human genome. 
Microsatellite loci are classified based on the length of the smallest repetitive unit. 
For example, loci with repetitive units of 1 to 5 base pairs in length are termed 
"mono-nucleotide", "di-nucleotide", "tri-nucleotide", "tetra-nucleotide", and "penta- 
30 nucleotide" repeat loci, respectively. 

Each microsatellite locus of normal genomic DNA for most diploid species, 
such as genomic DNA from mammalian species, consists of two alleles at each locus. 
The two alleles can be the same or different from one another in length and can vary 
from one individual to the next. Microsatellite alleles are normally maintained at 



-2- 



constant length in a given individual and its descendants; but, instability in the length 
of micros atellites has been observed in some tumor types (Aaltonen et al, 1993, 
Science 260:812-815; Thibodeau et al, 1993 Science 260:816-819; Peltomaki et al, 
1993 Cancer Research 53:5853-5855; Ionov et al, 1993 Nature 363:558-561). This 

5 form of genomic instability in tumors, termed microsatellite instability (hereinafter, 
"MSI"), is a molecular hallmark of the inherited cancer syndrome Hereditary 
Nonopolyposis Colorectal Cancer (hereinafter, "HNPCC"). The cause of MSI in 
HNPCC is thought to be a dysfunctional DNA mismatch repair system that fails to 
reverse errors that occur during DNA replication (Fishel et al, 1993 Cell 75:1027-38; 

10 Leach et al, 1993 Cell 75:215-25; Bronner et al, 1994 Nature 368:258-61; 
Nicolaides et al, 1994 Nature 371:75-80; Miyaki et al, 1997 Nat Genetics 17:271-2). 
Insertion or deletion of one or more repetitive units during DNA replication persists 
without mismatch repair and can be detected as length polymorphisms by comparison 
of allele sizes found in microsatellite loci amplified from normal and tumor DNA 

15 samples (Thibodeau et al, 1993, supra). 

MSI has been found in over 90% of HNPCC and in 10-20% of sporadic 
colorectal tumors (Liu et al, 1996 Nature Med 2:169-174; Thibodeau et al, 1993, 
supra\ Ionov et al, 1993 Nature 363:558-561; Aaltonen et al, 1993 Science 260: 
812-816; Lothe et al, 1993 Cancer Res. 53: 5849-5852). However, MSI is not 

20 limited to colorectal tumors. MSI has also been detected in pancreatic cancer (Han et 
al, 1993 Cancer Res 53:5087-5089) gastric cancer (Id.; Peltomaki et al, 1993 Cancer 
Res 53:5853-5855; Mironov et al, 1994 Cancer Res 54:41-44; Rhyu et al, 1994 
Oncogene 9:29-32; Chong et al, 1994 Cancer Res 54:4595-4597), prostate cancer 
(Gao et al, 1994 Oncogene 9:2999-3003), endometrial cancer (Risinger et al, 1993 

25 Cancer Res 53:5100-5103; Peltomaki et al, 1993 Cancer Res 53:5853-5855), and 
breast cancer (Patel et al, 1994 Oncogene 9:3695-3700). 

The genetic basis of HNPCC is thought to be a germ-line mutation in one of 
several DNA mismatch repair genes (hereinafter "MMR") (Leach et al., 1993 Cell 
75:1215-1225; Fishel et al, 1993 Cell 75:1027-38; Leach et al, 1993 Cell 75:215-25; 

30 Bronner et al, 1994 Nature 368:258-61; Nicolaides et al, 1994 Nature 371:75-80; 
Miyaki et al, 1997 Nat Genetics 17:271-2; Papadopoulos et al, 1994 Science 
263:1625-1629) Among HNPCC patients, 50-60% have been reported to carry 
inherited mutations in two mismatch repair genes, MSH2 and MLH1 (Kolodner et al, 
1999 Cancer Research 59:5068:5074). Moreover, 70-100% of HNPCC cases whose 
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tumors manifest a high frequency MSI (hereinafter "MSI-H") phenotype reportedly 
have germ-line mutations in these two genes. Few germ-line mutations in MSH6, 
MSH3, PMS1 and PMS2 genes have been reported in HNPCC patients, indicating 
that inherited mutations in these mismatch repair genes play a minor role in HNPCC 
5 (Peltomaki et aL, 1997 Gastroenterologly 113:1146-1158; Liu et aL, 1996 Nat Med 
2:169-174; Kolodner et aL, 1999 Cancer Research 59:5068-5074). Without 
functional repair proteins, errors that occur during replication are not repaired leading 
to high mutation rates and increased likelihood of tumor development. 

Repetitive DNA is particularly sensitive to errors in replication and therefore 
10 dysfunctional mismatch repair systems result in widespread alterations in 
microsatellite regions. A study of yeast cells without functional mismatch repair 
systems showed a 2800, 284, 52, and 19 fold increase in mutation rates in mono-, di-, 
tri-, tetra-, and penta-nucleotide repeats, respectively (Sia et aL, 1997 Molecular and 
Cellular Biology 17:2851-2858). Mutations in mismatch repair genes are not thought 

15 to play a direct role in tumorigenesis, but rather act by allowing DNA replication 
errors to persist. Mismatch repair deficient cells have high mutation rates and if these 
mutations occur in genes involved in tumorigenesis the result can lead to the 
development of cancer. MSI positive tumors have been found to carry somatic 
frameshift mutations in mono-nucleotide repeats in the coding region of several genes 

20 involved in growth control, apoptosis, and DNA repair (e.g., TGFBRII, BAX, 
IGFIIR, TCF4, MSH3, MSH6) (Planck et aL, 2000 Genes, Chromosomes & Cancer 
29:33-39; Yamamoto et aL, 1998 Cancer Research 58:997-1003; Grady et aL, 1999 
Cancer Research 59:320-324; Markowitz et aL, 1995 Science 268:1336-1338: 
Parsons et aL, 1995 Cancer Research 55:5548-5550). The most commonly altered 

25 locus is TGFBRII, in which over 90% of MSI-H colon tumors have been found to 
contain a mutation in the 10 base polyadenine repeat present in the gene (Markowitz 
et aL, 1995 Science 268:1336-1338). 

MSI occurs in almost all HNPCC tumors regardless of which mismatch repair 
gene is involved. MSI has also been shown to occur early in tumorigenesis. These 

30 two factors contribute to making MSI analysis an excellent diagnostic test for the 
detection of HNPCC. In addition, MSI analysis can serve as a useful pre-screening 
test to identify potential HNPCC patients for further genetic testing. MSI analysis of 
sporadic colorectal carcinomas is also desirable, since the occurrence of MSI 



correlates with a better prognosis (Bertario et al, 1999 International J Cancer 80:83- 
7). 

One long-standing problem with diagnosing HNPCC is that colon tumor 
biopsies from a person with HNPCC look the same pathologically as a sporadic colon 
tumor, making diagnosis of the syndrome difficult. Since prognosis, therapy and 
follow-up will be different for patients with HNPCC, it is important to find more 
definitive diagnostic methods. However, mutation detection in HNPCC patients 
remains difficult because there are at least 5 known MMR genes which are large 
genes without known hot spots for mutations. Direct gene sequencing remains the 
most precise method of mutation detection, but is time consuming and expensive 
(Terdiman et al, 1999 The American Journal of Gastroenterology 94:23544-23560). 
In addition, high sensitivity and specificity can be difficult to obtain with sequencing 
alone because many mutations that are detected may be harmless polymorphismsthat 
have no affect on the function of the mismatch repair proteins. 

DNA analysis of microsatellite loci makes it theoretically possible to develop 
a blood test for use in the detection of specific types of cancer. Early studies have 
shown that tumor DNA is released into the circulation, and is present in particularly 
high concentrations in plasma and serum in a number of different types of cancer 
(Leon etal, 1977 Cancer Res 37:646-650; Stroun et al, 1989 Oncology 46:318-322). 
Since then, DNA released into the blood from several different types of tumors has 
been detected by analysis of microsatellite DNA using the polymerase chain reaction 
(hereinafter, "PCR") (Hibi et al, 1998 Cancer Research 58:1405-1407; Chen et al, 
1999 Clinical Cancer Research 5:2297-2303; Kopreski et al, 1999 Clinical Cancer 
Research 5:1961-1965; Fujiwara et al, 1999 Cancer Research 59:1567-1571; Chen et 
al, 1996 Nature Medicine 2:1033-1034; Goessl et al, 1998 Cancer Research 
58:4728-4732; Miozzo et al, 1996 Cancer Research 56:2285-2288). 

The first tumor-specific gene sequences detected in blood from patients with 
cancer were mutated K-ras genes (Vasioukhin et aL 3 1994 Br. J Haematol 86: 774- 
779; Sorenson et al, 1994 Cancer Epidemiol Biomark. Prev. 3:67-71; Sorenson et 
al, 2000 Clinical Cancer Research 6:2129-2137; Anker et al, 1997 
Gastroenterology 112:1114-1120). More recently, detection of microsatellite 
instability in soluble tumor DNA from plasma and serum originating from head and 
neck squamous cell cancers (Nawroz et al, 1996 Nature Med 2:1035-1037) and small 
cell lung cancers (Chen et al, 1996 Nature Med 2:1033-1035) has been shown. 
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These successes have stimulated searches for microsatellite instability in circulating 
tumor DNA from many other cancer types. Hibi et al, used microsatellite markers to 
search for the presence of genetic alterations in serum DNA from colon cancer 
patients (Hibi, K. et al, 1998 Cancer Research 58:1405-1407). Hibi et al, also 
5 reported that eighty percent of primary tumors in the colon cancer patients displayed 
MSI and/or loss of heterozygosity (hereinafter, "LOH"), another type of mutation 
discussed below. No microsatellite or LOH mutations were detected in paired serum 
DNA. However, identical K-ras mutations were found in corresponding tumor and 
serum DNAs, indicating that tumor DNA was present in the blood. (Id.) 

10 The detection of circulating tumor cells and micrometastases may also have 

important prognostic and therapeutic implications. Because disseminated tumor cells 
are present in very small numbers, they are not easily detected by conventional 
immunocytological tests, which can only detect a single tumor cell among 10,000 to 
100,000 normal cells (Ghoussein et al, 1999 Clinical Cancer Research 5:1950-1960). 

15 More sensitive molecular techniques based on PCR amplification of tumor-specific 
abnormalities in DNA or RNA have greatly facilitated detection of occult (hidden) 
tumor cells. PCR-based tests capable of routinely detecting one tumor cell in one 
million normal cells have been devised for identification of circulating tumor cells 
and micrometastases in leukemias, lymphomas, melanoma, neuroblastoma, and 

20 various types of carcinomas. (Id.) 

Most targets for detection of disseminated tumor cells have been mRNAs. 
However, some DNA targets have been used successfully, including K-ras mutations 
in colon cancers, as noted above. The presence of microsatellite instability in some 
types of tumor cells raises the possibility that these tumor specific mutations created 

25 by the instability could serve as a target for PCR-based detection of occult tumor 
cells. 

There has been considerable controversy about how to precisely define and 
accurately measure MSI (Boland, 1998 Cancer Research 58:5248-5257). Reports on 
the frequency of MSI in various tumors ranges considerably. For example, different 
30 studies have reported ranges of 3% to 95% MSI for the frequency of MSI in bladder 
cancer (Gonzalez-Zulueta et al, 1993 Cancer Research 53:28-30; Mao et al, 1996 
PNAS 91:9871-9875). One problem with defining MSI is that it is both tumor 
specific and locus dependent (Boland et al 1998, supra). Thus, the frequency of MSI 
observed with a particular tumor type in a single study will depend on the number of 



tumors analyzed, the number of loci investigated, how many loci need to be altered to 
score a tumor as having MSI and which particular loci were included in the analysis. 
To help resolve these problems, the National Cancer Institute sponsored a workshop 
on MSI to review and unify the field (Id.). As a result of the workshop a panel of five 
microsatellites was recommended as a reference panel for future research in the field. 
This panel included two mono-nucleotide loci BAT-25, BAT-26, and three 
dinucleotide loci D5S346, D2S123, D17S250. 

One particular problem in MSI analysis of tumor samples occurs when one of 
the normal alleles for a given marker is missing due to LOH, and no other novel 
fragments are present for that marker (Id.). One cannot easily discern whether this 
represents true LOH or MSI in which the shifted allele has co-migrated with the 
remaining wild-type allele. In cases like this, the recommendation from the NCI 
workshop on MSI was not to call it as MSI. One way to minimize this type of 
problem would be to use loci that displayed low frequency of LOH in colon tumors. 

Clinical diagnostic assays used for determining treatment and prognosis of 
disease require that the tests be highly accurate (low false negatives) and specific (low 
false positive rate). Many informative microsatellite loci have been identified and 
recommended for MSI testing (Boland et al 1998, supra). However, even the most 
informative microsatellite loci are not 100% sensitive and 100% specific. To 
compensate for the lack of sensitivity using individual markers, multiple markers can 
be used to increase the power of detection. The increased effort required to analyze 
multiple markers can be offset by multiplexing. Multiplexing allows simultaneous 
amplification and analysis of a set of loci in a single tube and can often reduce the 
total amount of DNA required for complete analysis. To increase the specificity of an 
MSI assay for any given type of cancer, it has been recommended that the panel of 
five highly informative microsatellite loci identified at the National Institute 
Workshop (see above) be modified to substitute or add other loci of equal utility 
(Boland et al 1998, supra, at p. 5250). Increased information yielded from 
amplifying and analyzing greater numbers of loci results in increased confidence and 
accuracy in interpreting test results. 

Multiplex MSI analysis solves problems of accuracy and discrimination of 
MSI phenotypes, but the additional complexity can make analysis more challenging. 
For example, when microsatellite loci are co-amplified and analyzed in a multiplex 
format, factors affecting ease and accuracy of data interpretation become much more 
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essential. One of the primary factors affecting accurate data interpretation is the 
amount of stutter that occurs at microsatellite loci during PCR (Bacher & Schumm, 
1998 Profiles in DNA 2:3-6; Perucho, 1999 Cancer Research 59:249-256). Stutter 
products are minor fragments produced by the PCR process that differ in size from 
5 the major allele by multiples of the core repeat unit. The amount of stutter observed 
in microsatellite loci tends to be inversely correlated with the length of the core repeat 
unit. Thus, stutter is most severely displayed with mono- and di-nucleotide repeat 
loci, and to a lesser degree with tri-, tetra-, and penta-nucleotide repeats (Bacher & 
Schumm, 1998, supra). Use of low stutter loci in multiplexes would greatly reduce 

10 this problem. However, careful selection of loci is still necessary in choosing low 
stutter loci because percent stutter can vary considerably even within a particular 
repeat type (Micka et a/., 1999 Journal of Forensic Sciences 44: 1-15). 

Microsatellite multiplex systems have been primarily developed for use in 
genotyping, mapping studies and DNA typing applications. These multiplex systems 

15 are designed to allow co-amplification of multiple microsatellite loci in a single 
reaction, followed by detection of the size of the resulting amplified alleles. For DNA 
typing analysis, the use of multiple microsatellite loci dramatically increases the 
matching probability over a single locus. Matching probability is a common statistic 
used in DNA typing that defines the number of individuals you would have to survey 

20 before you would find the same DNA pattern as a randomly selected individual. For 
example, a four locus multiplex system (GenePrint™ CTTv Multiplex System, 
Promega) has a matching probability of 1 in 252.4 in African-American populations, 
compared to an eight locus multiplex system (GenePrint™ PowerPlex™ 1.2 System, 
Promega) which has a matching probability of 1 in 2.74 x 10 8 (Proceedings: 

25 American Academy of Forensic Sciences (Feb. 9-14, 1998), Schumm, James W. et ah, 
p. 53, B88; Id, Gibson, Sandra D. et al, } p. 53, B89; Id., Lazaruk, Katherine et al y p. 
51, B83; Sparkes, R. et al, 1996 Int J Legal Med 109:186-194). Other commercially 
available multiplex systems for DNA typing include AmpFZSTR Profiler™ and 
AmpF/STR COfiler™ [AmpFlSTR Profiler™ PCR Amplification Kit User's Manual 

30 (1997), i-viii and 1-1 to 1-10; and AmpFlSTR COfiler™ PCR Amplification Kit User 
Bulletin (1998), i-iii and 1-1 to 1-10, both published by Perkin-Elmer Corp). In 
addition to multiplexes for DNA typing, a few multiplex microsatellite systems have 
been developed for the detection of diseases, such as cancer. One such system has 
been developed by Roche Diagnostics, the "HNPCC Microsatellite Instability Test", 
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in which five MSI loci (BAT25, BAT26, D5S436, D17S250, and D2S123) are co- 
amplified and analyzed. Additional systems are needed, particularly systems that 
include additional loci displaying high sensitivity to MSI and low stutter for easy and 
accuracy of analysis. 

5 The materials and methods of the present invention are designed for use in 

multiplex analysis of particular microsatellite loci of human genomic DNA from 
various sources, including various types of tissue, cells, and bodily fluids. The 
present invention represents a significant improvement over existing technology, 
bringing increased power of discrimination, precision, and throughput to the analysis 
10 of MSI loci and to the diagnosis of illness, such as cancer, related to mutations at such 
loci. 

BRIEF SUMMARY OF THE INVENTION 
The present invention provides methods and kits for amplifying and analyzing 
15 microsatellite loci or sets of microsatellite loci. The present invention also provides 
methods and kits for detecting cancer in an individual by co-amplifying multiple 
microsatellite loci of human genomic DNA originating from tumor tissue or 
cancerous cells. 

In one aspect, the present invention provides a method of analyzing micro- 
20 satellite loci, comprising: (a) providing primers for co-amplifying in a single tube a 
set of at least three microsatellite loci of genomic DNA, comprising at least one 
mono-nucleotide repeat locus and at least two tetra-nucleotide repeat loci; (b) co- 
amplifying the set of at least three microsatellite loci from a sample of genomic DNA 
in a multiplex amplification reaction, using the primers, thereby producing amplified 
25 DNA fragments; and (c) determining the size of the amplified DNA fragments. 

In another aspect, the present invention provides a method of co-amplifying 
the set of at least three microsatellite loci of at least two different samples of genomic 
DNA, a first sample originating from normal non-cancerous biological material from 
an individual and a second sample originating from a second biological material from 
30 the individual. The at least two samples of human genomic DNA are co-amplified in 
separate multiplex amplification reactions, using primers to each of the loci in the set 
of at least three microsatellite loci. The size of the resulting amplified DNA 
fragments from the two multiplex reactions are compared to one another to detect 
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instability in any of the at least three microsatellite loci of the second sample of 
human genomic DNA. 

Another embodiment of the present invention is a method of analyzing at least 
one mono-nucleotide repeat locus of human genomic DNA selected from the group 
5 consisting of MONO- 11 and MONO-15. The method of analyzing the at least one 
mono-nucleotide repeat locus selected from the group consisting of MONO- 11 and 
MONO-15 comprises the steps of: (a) providing at least one primer of the at least one 
mono-nucleotide repeat locus; (b) amplifying the at least one mono-nucleotide repeat 
locus from a sample of genomic DNA originating from a biological material from an 

10 individual human subject, using the at least one primer, thereby producing an 
amplified DNA fragment; and (c) determining the size of the amplified DNA 
fragments. The amplified DNA fragments are preferably analyzed to detect 
microsatellite instability at the at least one mono-nucleotide repeat locus by 
comparing the size of the amplified DNA fragments to the most commonly observed 

15 allele size at that locus in a human population. Alternatively, the method is used to 
amplify the at least one mono-nucleotide repeat locus of a sample of human genomic 
DNA from normal non-cancerous biological material from an individual, and 
microsatellite instability is detected by comparing the resulting amplified DNA 
fragments to those obtained in step (b). 

20 Another embodiment of the present invention is a kit for the detection of 

microsatellite instability in DNA isolated from an individual subject, comprising a 
single container with oligonucleotide primers for co-amplifying a set of at least three 
microsatellite loci comprising one mono-nucleotide locus and two tetra-nucleotide 
loci. 

25 The various embodiments of the method and kit of the present invention, 

described briefly above, are particularly suited for use in the detection of MSI in 
tumor cells or cancerous cells. Specifically, the method or kit of the present invention 
can be used to amplify at least one mono-nucleotide repeat locus selected from the 
group consisting of MONO-11 and MONO-15 or the set of at least three 

30 microsatellite loci comprising at least one mononucleotide repeat locus and at least 
two tetranucleotide repeat loci of at least one sample of genomic DNA from 
biological material, such as tissue or bodily fluids, preferably biological material 
containing or suspected of containing DNA from tumors or cancerous cells. For 
monomorphic or quasi-monomorphic loci, such as MONO-11 and MONO-15, one 
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can compare the resulting pattern to the pattern produced by amplifying normal DNA 
from any individual in a population with a standard pattern at that locus. However, it 
is preferable to use DNA from normal tissue of the same individual from whom the 
tumor DNA was obtained, in order to ensure that a positive result does not reflect a 
5 germline mutation, rather than MSI. 

The method and kit can also be used to compare the results of multiplex 
amplification of DNA from normal tissue of an individual to the results of multiplex 
amplification of DNA from other biological material from the same individual. Use 
of this particular embodiment of the method of the present invention to detect MSI in 

10 tumor cells by comparison to normal cells is illustrated in Figure 1. Specifically, 
Figure 1 shows a tetra-nucleotide repeat (GATA), amplified by a primer pair ("primer 
A" and "primer B") in a polymerase chain reaction ("PCR"), followed by separation 
of amplified alleles by size using capillary electrophoresis, and a plot of the 
fractionated amplified alleles using GeneScan™ software. Note that only the two 

15 alleles and small stutter peaks appear in the plot of amplified DNA from normal 
DNA, while three MSI peaks appear in addition to the two allele peaks in the plot of 
amplified tumor DNA. 

Advantages and a fuller appreciation of the specific attributes of this invention 
will be gained upon an examination of the following figures, detailed description of 

20 preferred embodiments, and appended claims. It is expressly understood that the 
drawings are for the purposes of illustration and description only, and are not intended 
as a definition of the limits of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1. Illustration of microsatellite instability analysis. The figure is a 

diagram of a primer pair annealed to a tetra-nucleotide locus on two alleles of the 
same genomic DNA, and plots of results of capillary electrophoresis of products of 
amplification of a tetra-nucleotide locus of DNA originating from normal vs. tumor 
tissue. MSI peaks are indicated in the plot of amplified DNA from tumor tissue. 
30 Figure 2. Illustration of effect of microsatellite repeat unit length on amount 

of stutter observed. The figure includes a diagram of a primer pair annealed to a 
tetranucleotide repeat locus on two different alleles of genomic DNA, and a set of 
fluorescent scans and plots of amplified mono-, di-, tri-, tetra-, and penta-nucleotide 
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repeat loci from human genomic DNA from various individuals, amplified and 
fractionated by gel or by capillary electrophoresis. 

Figure 3. Demonstration that low stutter tetranucelotide repeat loci are easier 
to interpret than high stutter dinucleotide repeat loci. The figure is a plot of results of 
5 capillary electrophoresis of products of the amplification of two tetra-nucleotide and 
two di-nucleotide repeat loci of two different sets of samples of DNA originating 
from normal vs. tumor tissue. 

Figure 4. Illustration of variance in amount of stutter within selected 
tetranucleotide and pentanucleotide repeat loci. The figure is a plot of the variability 
10 in percent stutter observed in a 13 different tetra-nucleotide and 5 different 
pentanucleotide repeat loci. The boxes represent the average percent stutter and the 
solid bars the range of stutter observed for each locus. 

Figure 5. Results of screening of tetranucleotide repeat markers for frequency 
of microsatellite instability. The figure is a plot of the number of micros atellite loci, 
15 out of a total of 273 markers, that displays a given percent MSL For example, 
approximately 15 loci were altered in 100% of MSI-H tumor samples evaluated. 

Figure 6. Results of screening of pentanucleotide repeat markers for frequency 
of microsatellite instability. The figure is a plot of the percent MSI observed for each 
of eight different tetra-nucleotide repeat loci in a set of nine MSI-H and a set of 30 
20 MSS tumors. 

Figure 7. Microsatellite instability analysis using MONO-15 marker. The 
figure is a plot generated from capillary electrophoresis products of amplification of 
the MONO-15 locus of DNA from four different sets of paired normal and tumor 
samples originating from four different individuals. 
25 Figure 8. Percent MSI in 59 colon cancer samples using nine-locus MSI 

multiplex. The figure is a plot of the percent MSI observed in 59 colon cancer 
samples (29 MSH and 30 MSI-L or MSS samples) using a nine locus MSI multiplex 
(D1S518, D3S2432, D7S1808, D7S3046, D7S9070, D10S1426, BAT-25, BAT-26, 
and MONO-15. 

30 Figure 9. Fluorescent multiplex microsatellite analysis using nine-locus MSI 

Multiplex. The figure is a plot generated from capillary electrophoresis of products of 
multiplex amplification of normal non-cancerous human genomic, using the nine 
locus MSI multiplex used in Figure 8. 
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Figure 10. Detection of microsatellite instability in colon cancer samples 
using nine-locus MSI multiplex. The figure is a plot generated from capillary 
electrophoresis of products of multiplex amplification of DNA from paired normal 
and colon tumor sample, using the nine locus MSI multiplex used in Figure 8. 
5 Figure 11. Detection of microsatellite instability in colon cancer samples using 

nine-locus MSI multiplex is the same type of plot shown in Figure 10, generated 
using a different sample of paired normal and colon cancer DNA from a different 
individual. 

Figure 12. Detection of microsatellite instability in stomach cancer samples 
10 using nine-locus MSI multiplex. The figure is a plot generated from capillary 
electrophoresis of products of multiplex amplification of DNA from paired normal 
and stomach cancer tumor samples, using the nine locus MSI multiplex described in 
Figure 8. 

Figure 13. Microsatellite analysis of paraffin embedded tissues with nine- 
15 locus MSI multiplex. The figure is a plot generated from capillary electrophoresis of 
products of multiplex amplification of DNA from paraffin embedded tissue, using the 
nine locus MSI multiplex described in Figure 8. 



20 DETAILED DESCRIPTION OF THE INVENTION 

A. Definitions 

The following definitions are intended to assist in providing a clear and 
consistent understanding of the scope and detail of the following terms, as used to 
describe and define the present invention: 
25 "Allele", as used herein, refers to one of several alternative forms of a gene or 

DNA sequence at a specific chromosomal location (locus). At each autosomal locus 
an individual possesses two alleles, one inherited from the father and one from the 
mother. 

"Amplify", as used herein, refers to a process whereby multiple copies are 
30 made of one particular locus of a nucleic acid, such as genomic DNA. Amplification 
can be accomplished using any one of a number of known means, including but not 
limited to the polymerase chain reaction (PCR) (Saiki, R. K., et aL, 1985 Science 230: 
1350-1354), transcription based amplification (Kwoh, D. Y., and Kwoh, T. J., 
American Biotechnology Laboratory, October, 1990) and strand displacement 
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amplification (SDA) (Walker, G. T., et al, 1992 Proc. Natl Acad, Scl, U.S.A. 89: 
392-396). 

"Co-amplify", as used herein, refers to a process whereby multiple copies are 
made of two or more loci in the same container, in a single amplification reaction. 
5 "DNA polymorphism", as used herein, refers to the existence of two or more 

alleles for a given locus in the population. "Locus" or "genetic locus", as used herein, 
refers to a unique chromosomal location defining the position of an individual gene or 
DNA sequence. "Locus-specific primer", as used herein, refers to a primer that 
specifically hybridizes with a portion of the stated locus or its complementary strand, 

10 at least for one allele of the locus, and does not hybridize efficiently with other DNA 
sequences under the conditions used in the amplification method. 

"Loss of Heterozygosity" (hereinafter, "LOH"), as used herein, refers to the 
loss of alleles on one chromosome detected by assaying for markers for which an 
individual is constitutionally heterozygous. Specifically, LOH can be observed upon 

15 amplification of two different samples of genomic DNA from a particular subject, one 
sample originating from normal biological material and the other originating from a 
tumor or pre-cancerous tissues. The tumor exhibits LOH if DNA from the normal 
biological material produces amplified alleles of two different lengths and the tumor 
samples produces only one of the two lengths of amplified alleles at the same locus. 

20 "Microsatellite Locus", as used herein, refers to a region of genomic DNA that 

contains short, repetitive sequence elements of one (1) to seven (7), more preferably 
one (1) to five (5), most preferably one (1) to four (5) base pairs in length. Each 
sequence repeated at least once within a microsatellite locus is referred to herein as a 
"repeat unit." Each microsatellite locus preferably includes at least seven repeat units, 

25 more preferably at least ten repeat units, and most preferably at least twenty repeat 
units. 

"Microsatellite Instability" (hereinafter, "MSI"), as used herein, refers to a 
form of genetic instability in which alleles of genomic DNA obtained from certain 
tissue, cells, or bodily fluids of a given subject change in length at a microsatellite 
30 locus. Specifically, MSI can be observed upon amplification of two different samples 
of genomic DNA from a particular subject, such as DNA from healthy and cancerous 
tissue, wherein the normal sample produces amplified alleles of one or two different 
lengths and the tumor sample produces amplified alleles wherein at least one of the 
alleles is of a different length from the amplified alleles of the normal sample of DNA 
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at that locus. MSI generally appears as a result of the insertion or deletion of at least 
one repeat unit at a microsatellite locus. 

"MSI-H", as used herein, is a term used to classify tumors as having a high 
frequency of MSI. When five microsatellite loci are analyzed, such as the five 
5 microsatellite loci of selected by a workshop on HNPCC at the National Cancer 
Institute in 1998 for use in the detection of HNPCC, a tumor is classified as MSI-H 
when at least two of the loci show instability (Boland, 1998 Cancer Research 58: 
5248-5257). When more than five microsatellite loci are analyzed, a tumor is 
classified as MSI-H when at least 30% of the microsatellite loci of genomic DNA 
10 originating from the tumor is are found to be unstable. 

"MSI-L", as used herein, is a term used to classify tumors as having a low 
frequency of MSL When five microsatellite loci are analyzed, such as the five 
microsatellite loci of selected by a workshop on HNPCC at the National Cancer 
Institute in 1998 for use in the detection of HNPCC, a tumor is classified as MSI-L 
15 when only one of the loci shows instability. When more than five microsatellite loci 
are analyzed, a tumor is classified as MSI-L when less than 30% of the microsatellite 
loci of genomic DNA originating from the tumor is are found to be unstable. MSI-L 
tumors are thought to represent a distinct mutator phenotype with potentially different 
molecular etiology than MSI-H tumors (Thibodeau, 1998; Wu et aL, 1999, Am J Hum 
20 Genetics 65:1291-1298). To accurately distinguish MSI-H and MSI-L phenotypes it 
has been recommended that more than five microsatellite markers be analyzed 
(Boland, 1998, supra; ; Frazer et al, 1999 Oncology Research 6:497-505). 

"MSS", as used herein, refers to tumors which are microsatellite stable, when 
no microsatellite loci exhibit instability. The distinction between MSI-L and MSS 
25 can also only be accomplished when a significantly greater number of markers than 
five are utilized. The National Cancer Institute recommended use of an additional 19 
mono- and di-nucleotide repeat loci for this purpose, and for the purpose of making 
clearer distinctions between MSI-H and MSI-L tumors, as described above (Boland, 
1998, supra), 

30 "MSI-L/S", as used herein, refers to all classified as either MSI-L or MSS. 

"Microsatellite marker", as used herein, refers to a fragment of genomic DNA 
which includes a microsatellite repeat and nucleic acid sequences flanking the repeat 
region. 
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"Monomorphic", as used herein, refers to a locus of genomic DNA where only 
one allele pattern has been found to be present in the normal genomic DNA of all 
members of a population. 

"Nucleotide", as used herein, refers to a basic unit of a DNA molecule, which 
5 includes one unit of a phosphatidyl back bone and one of four bases, adenine ("A"); 
thymine ("T"); guanine ("G"); and cytosine ("C"). 

"Polymerase chain reaction" or "PCR", as used herein, refers to a technique in 
which cycles of denaturation, annealing with primer, and extension with DNA 
polymerase are used to amplify the number of copies of a target DNA sequence by 
10 approximately 10 6 times or more. The polymerase chain reaction process for 
amplifying nucleic acid is covered by U. S. Patent Nos. 4,683,195 and 4,683,202, 
which are incorporated herein by reference for a description of the process. 

"Primer", as used herein, refers to a single-stranded oligonucleotide or DNA 
fragment which hybridizes with a strand of a locus of target DNA in such a manner 
15 that the 3' terminus of the primer may act as a site of polymerization using a DNA 
polymerase enzyme. 

"Primer pair", as used herein, refers to a pair of primers which hybridize to 
opposite strands a target DNA molecule, to regions of the target DNA which flank a 
nucleotide sequence to be amplified. 
20 "Primer site", as used herein, refers to the area of the target DNA to which a 

primer hybridizes. 

"Quasi-monomorphic", as used herein, refers to a locus of genomic DNA 
where only one allele pattern has been found to be present in the normal genomic 
DNA of almost all the members of a population 

25 "Stutter", as used herein, refers to a minor fragment observed after 

amplification of a microsatellite locus, one or more repeat unit lengths smaller than 
the predominant fragment or allele. It is believed to result from a DNA polymerase 
slippage event during the amplification process (Levinson & Gutman, 1987 
Molecular Bioliolgy Evolution 4:203; Schlotterer and Tautz, 1992 Nucleic Acids 

30 Research 20:211). 

B. Selection of Loci to be Amplified or Co- Amplified: 

At least one MSI locus amplified or co-amplified in each of the embodiments 
of the present invention illustrated and discussed herein is a mono-nucleotide repeat 
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locus. Such loci have been shown to very susceptible to alteration in tumors with 
dysfunctional DNA mismatch repair systems (Parsons., 1995 supra), making such 
loci particularly useful for the detection of cancer and other diseases associated with 
dysfunctional DNA mismatch repair systems. One group of researchers reported that 
5 by amplifying and analyzing a single mono-nucleotide repeat locus, BAT-26, they 
were able to correctly confirm the MSI-H status of 159 out of 160 (99.4% accuracy) 
tumor samples (Hoang et al } 1997 Cancer Research 57:300-303). 

Some mono-nucleotide repeat loci, including BAT-26, have also been 
identified as having quasi-monomorphic properties. Monomorphic or quasi- 

10 monomorphic properties make the comparison of normal/tumor pairs simpler, since 
PCR products from normal samples are generally all the same size and any alterations 
in tumor samples are easily identified. 

The principal draw-back to using a mono-nucleotide repeat locus in the 
analysis of genomic DNA is that amplification of any such locus results in a large 

15 number of extraneous amplified fragments of DNA of various lengths, the product of 
"stutter" during the amplification reaction. Such artifacts are present to a lesser 
degree in the products of amplifying loci with increasingly longer repeat units. For an 
illustration of the relationship between repeat unit length and the presence of 
extraneous amplified fragments, see Figure 2. Figure 2 shows increased stutter 

20 artifacts with decreasing repeat unit length from penta-nucleotide to mono-nucleotide 
repeat loci. 

When a mono-nucleotide locus is monomorphic or quasi-monomorphic, 
however, one can readily detect shifts in the size of an allele, indicating MSI, even in 
the presence of a high degree of stutter. When a locus is quasi-monomorphic, 

25 detection of shifts in size can be done by comparison of amplified alleles from 
genomic DNA from biological material of an individual, such as tumor tissue or 
bodily fluids, suspected of exhibiting microsatellite instability to the most commonly 
observed allele size at that locus in a population. This feature enables one to use a 
single standard or panel of standard allele patterns to analyze individual results, 

30 minimizing the amount of samples which must be taken from an individual in order to 
detect microsatellite instability in certain genomic DNA of the individual. 

At least one of the microsatellite loci amplified in the method or using the kit 
of the present invention is preferably a mono-nucleotide repeat locus, more preferably 
a quasi-monomorphic mono-nucleotide repeat locus. The mono-nucleotide repeat 
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locus selected for use in the methods and kits of the present invention is preferably 
unstable in cancerous biological material, but not in normal biological material. 
BAT-25 and BAT-26 have been identified as mono-nucleotide repeat loci useful in 
the identification of MSI in colorectal tumors characteristic of Hereditary 
5 Nonpolyposis Colon Cancer (Zhou et al. y 1998 Genes, Chromosomes & Cancer 
21:101-107; Dietmaier et al 9 1997 Cancer Research 57:4749-4756; Hoang et aL, 
1997 Cancer Research 57:300-303). Two additional loci, identified herein as 
MONO- 11 and MONO- 15 were identified through a search of a public computerized 
database of sequence information (GenBank), and found to have the preferred 

10 characteristics for such loci, identified above. The search for and identification of 
mono-nucleotide repeat loci suitable for use in the present invention is illustrated in 
Example 2. Similar techniques could be used to identify other mono-nucleotide 
repeat loci suitable for use in the methods and kits of the present invention. 

The mono-nucleotide repeat loci amplified or co-amplified according to the 

15 present methods or using the present kits are preferably quasi-monomorphic and 
exhibit instability in the type of tissue of interest for a given application. MONO-11 
and MONO- 15, have been be particularly useful in the methods and kits of the present 
invention. Both loci are quasi-monomorphic and exhibit instability in several 
cancerous tumor tissues. At least one, more preferably at least two mono-nucleotide 

20 repeat microsatellite loci are amplified or co-amplified in the method of the present 
invention. 

At least one mono-nucleotide repeat locus and at least two tetra-nucleotide 
repeat loci are co-amplified and analyzed according to at least some embodiments of 
the method and kits of the present invention. Tetra-nucleotide repeat loci inherently 

25 generate very few stutter artifacts when amplified, compared to microsatellite loci 
with shorter repeat units, particularly compared to mono- and di-nucleotide repeat 
loci. {See, e.g., Figure 2.) Such artifacts can be difficult to distinguish from MSI if a 
shifted allele occurs at the stutter position of the second allele. Therefore, concerns 
about interpretation, and the need for quasi-monomorphism in order to make data 

30 interpretation possible is not present, as it is for mono-nucleotide repeat loci. In fact, 
one can even use tetra-nucleotide repeat loci which are highly polymorphic in a 
population, provided it is stable within an individual subject. Such loci are commonly 
used in DNA typing. 
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As with any locus to be amplified in any method or using any kit of the 
present invention, the tetra-nucleotide repeat loci are preferably selected on the basis 
of being stable in the DNA of an individual except in the type of biological material 
of interest. Preferred tetra-nucleotide repeat loci used in the methods and kits of the 
5 present invention include: FGA, D1S518, D1S547, D1S1677, D2S1790, D3S2432, 
D5S818, D5S2849, D6S1053, D7S3046, D7S1808, D7S3070, D8S1179, D9S2169, 
D10S1426, D10S2470, D12S391, D17S1294, D17S1299, and D18S5L 

Additional mono-nucleotide or tetra-nucleotide loci with the same preferred 
criteria described above are preferably co-amplified with the set of at least three 

10 microsatellite loci described above. However, it is contemplated that microsatellite 
loci other than mono-nucleotide repeat or tetra-nucleotide repeat loci could be 
included in the set of at least three microsatellite loci co-amplified and analyzed 
according to the method or using the kit of the present invention. 

Preferred methods for selection of loci and sets of loci amplified and analyzed 

15 according to the methods or using the kits of the present invention are discussed 
further, herein below. However, once the method and materials of this invention are 
disclosed, additional methods of selecting loci, primer pairs, and amplification 
techniques for use in the method and kit of this invention are likely to be suggested to 
one skilled in the art. All such methods are intended to be within the scope of the 

20 appended claims. 

C. Additional Screening of Loci 

When the method or kit of the present invention is to be used in clinical 
diagnostic assays to be used to determine treatment and prognosis of disease, it must 

25 be designed to produce results which are highly accurate (low false negatives) and 
specific (low false positive rate). Informative microsatellite loci are preferably 
identified by screening, more preferably by very extensive screening (see Examples 1 
and 2). However, even the most informative microsatellite loci are not 100% 
sensitive and 100% specific. 

30 The power of individual markers at detecting the presence of MSI in tissue 

associated with a particular disease, such as cancerous tumors, can be increased 
tremendously by multiplexing multiple markers. Increased information yielded from 
amplifying and analyzing greater numbers of loci results in increased confidence and 
accuracy in interpreting test results. To obtain needed sensitivity in detecting or 
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diagnosing diseases such as cancer, it has been recommended that one analyze five or 
more highly informative microsatellite loci (Boland, 1998 Cancer Research 58: 5248- 
5257). Multiplexing of microsatellite loci further simplifies MSI analysis by allowing 
simultaneous amplification and analysis of all multiple loci, while reducing the 
5 amount of often-limited DNA required for amplification. 

Another common problem in MSI determination relates to the occurrence of 
an intermediate MSI phenotype where only a small percentage (<30%) of 
microsatellite markers are altered in tumors (Boland, 1998, supra). These MSI-low 
tumors are thought to represent a distinct mutator phenotype with potentially different 

10 molecular etiology than MSI-H tumors (Thibodeau et aL, 1993 Science 260: 816-8; 
Wu et aL, 1999 Am J Hum Genetics 65:1291-1298; Kolodner et aL, 1999 Cancer 
Research 59:5068-5074; Wijnen et aL, 1999 Nature Genetics 23:142-144 ). It is not 
clear however if there is a real difference between MSI-L and MSS tumors. For 
purposes of diagnosis, MSI-L and MSS tumors are generally considered as one stable 

15 phenotypic class. To accurately distinguish MSI-H and MSI-L phenotypes it has been 
recommended that multiple microsatellite markers be analyzed (Boland, 1998; Frazer, 
1999 supra). 

It is contemplated that when the loci are to be co-amplified and analyzed in a 
multiplex amplification reaction, additional factors are taken into account, including 

20 ease and accuracy of interpretation of data. One of the primary factors affecting 
accurate data interpretation is the amount of stutter that occurs at microsatellite loci 
during PCR. Tetra-nucleotide repeat loci were chosen for inclusion in the MSI 
multiplex analyzed according to the method and using the kit of the present invention 
because they display considerably less stutter that shorter repeat types like di- 

25 nucleotides (Figure 2). However, careful selection of loci is still necessary in 
choosing low stutter loci because % stutter can vary considerably even within a 
particular repeat type (Figure 4). Mono-nucleotide repeat loci were chosen for 
individual analysis and for inclusion in the MSI Multiplex because of high rates of 
instability in diseased biological material of interest. 

30 Incidence of LOH is another factor in the selection of MSI loci to be amplified 

and analyzed in the methods or kits of the present invention. LOH can result in 
misidentification of a missing normal allele at a microsatellite marker as an indication 
of MSI when no other novel fragments are present for that marker. Specifically, one 
cannot easily discern whether this represents true LOH or MSI in which the shifted 
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allele has co-migrated with the remaining wild-type allele. In order to minimize the 
problem described above, the microsatellite markers selected for use in the present 
methods and kits preferably exhibit a low frequency of LOH, preferably no more than 
about 20% LOH, more preferably no more than about 14% LOH, even more 
5 preferably, no more than about 3% LOH. 

It is a relatively uncommon occurrence for a microsatellite market to possess 
all necessary attribute described above (i.e., high sensitivity, high specificity, low 
stutter, low LOH). The threshold for an MSI analysis system to be used in a 
diagnostic test is even higher, requiring robust and reproducible results from multiple 

10 loci in one assay using small quantities of DNA from difficult samples and be able to 
distinguish between MSI-L and MSI-H phenotypes. All the specific preferred mono- 
and tetra-nucleotide repeat loci identified herein above as being preferred for use in 
the present invention were found to meet each of the criteria for MSI loci suitable for 
use in diagnostic analysis, set forth herein above. 

15 Additional loci selection criteria particular to the two principal types of MSI 

loci amplified in the preferred multiplex analysis methods and using the kits of the 
present loci are described below. 

D. Design of Primers 

20 Primers for one or more microsatellite loci are provided in each embodiment 

of the method and kit of the present invention. At least one primer is provided for 
each locus, more preferably at least two primers for each locus, with at least two 
primers being in the form of a primer pair which flanks the locus. When the primers 
are to be used in a multiplex amplification reaction it is preferable to select primers 

25 and amplification conditions which generate amplified alleles from multiple co- 
amplified loci which do not overlap in size or, if they do overlap in size, are labeled in 
a way which enables one to differentiate between the overlapping alleles. 

Primers suitable for the amplification of individual loci preferably co- 
amplified according to the methods of the present invention are provided in Example 

30 4, Table 9, herein below. Primers suitable for use in a preferred multiplex of nine loci 
(i.e., BAT-25, D10S1426, D3S2432, BAT-26, D7S3046, D7S3070, MONO-15, 
D1S518, and D7S1808) are described in Example 6, Table 11. Guidance for 
designing this and other multiplexes is provided, below. It is contemplated that other 
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primers suitable for amplifying the same loci or other sets of loci falling within the 
scope of the present invention could be determined by one of ordinary skill in the art. 

E. Design and Testing of MSI Multiplex 

5 The method of multiplex analysis of microsatellite loci of the present 

invention contemplates selecting an appropriate set of loci, primers, and amplification 
protocols to generate amplified alleles from multiple co-amplified loci which 
preferably do not overlap in size or, more preferably, which are labeled in a way 
which enables one to differentiate between the alleles from different loci which 
10 overlap in size. Combinations of loci may be rejected for either of the above two 
reasons, or because, in combination, one or more of the loci do not produce adequate 
product yield, or fragments which do not represent authentic alleles are produced in 
this reaction. 

The following factors are preferably taken into consideration in deciding upon 

15 which loci to include in a multiplex of the present invention. To effectively design 
the microsatellite multiplex, size ranges for alleles at each locus are determined. This 
information is used to facilitate separation of alleles between all the different loci, 
since any overlap could result in an allele from one locus being inappropriately 
identified as instability at another locus. 

20 The amount of stutter exhibited by non-mononucleotide repeat loci is also 

preferably taken into consideration; as the amount of stutter exhibited by a locus can 
be a major factor in the ease and accuracy of interpretation of data. It is preferable to 
conduct a population study to determine the level of stutter present for each non- 
mono-nucleotide repeat locus. As noted above, tetra-nucleotide repeat markers 

25 display considerably less stutter that shorter repeat types like di-nucleotides and 
therefore can be accurately scored in MSI assays (Figures 2 and 3)(Bacher & 
Schumm, 1998 Profiles in DNA 2(2):3-6). Note that even within a class of 
microsatellite loci, such as tetra- and penta-nucleotide repeat loci, known to exhibit 
low stutter, the percent stutter can vary considerably within the repeat type (Figure 3; 

30 see also Figure 2) (Micka et aL, 1999, supra). 

Although at least one mono-nucleotide and at least two tetra-nucleotide repeat 
loci are included in the multiplex of MSI loci co-amplified according to the method or 
using the kit of the present invention, additional mono-nucleotide and/or tetra- 
nucleotide repeat loci can be included in the multiplex. It is also contemplated that 
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multisatellite loci other than mono- or tetra-nucleotide repeat loci meeting the same or 
similar criteria to the criteria described above would be included in the multiplex. 

The multiplex analyzed according to the present invention preferably includes 
a set of at least three MSI loci. It more preferably includes a set of at least five MSI 
loci, even more preferably a set of at least nine MSI loci. When the multiplex is a set 
of at least nine loci, it is most preferably a set of the following loci: BAT-25, 
D10S1426, D3S2432, BAT-26, D7S3046, D7S3070, MONO-15, D1S518, and 
D7S1808. A list of primers suitable for use in this multiplex is provided in Table 11 
of Example 6 below. 

It is also contemplated that other factors, such as successful combinations of 
materials and methods, are taken into consideration in designing a multiplex of MSI 
loci. Determination of such additional factors can be determined by following the 
selection methods and guidelines disclosed herein, and by using techniques known to 
one of ordinary skill in the art of the present invention. Specifically, the same or 
substantially similar techniques can be used to identify the preferred MSI loci and sets 
of MSI loci described herein below to select primer pair sequences, and to adjust 
primer concentrations to identify an equilibrium in which all included loci may be 
amplified. In other words, once the method and materials of this invention are 
disclosed, various methods of selecting loci, primer pairs, and amplification 
techniques for use in the method and kit of this invention are likely to be suggested to 
one skilled in the art. All such methods are intended to be within the scope of the 
present claims. 

F. Sources of Genomic DNA 

The genomic DNA amplified or co-amplified according to the methods of the 
present invention originates from biological material from an individual subject, 
preferably a mammal, more preferably from a dog, cat, horse, sheep, mouse, rat, 
rabbit, monkey, or human, even more preferably from a human or a mouse, and most 
preferably from a human being. The biological material can be any tissue, cells, or 
biological fluid from the subject which contains genomic DNA. The biological 
material is preferably selected from the group consisting of tumor tissue, disseminated 
cells, feces, blood cells, blood plasma, serum, lymph nodes, urine, and other bodily 
fluids. 
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The biological material can be in the form of tissue samples fixed in formalin 
and embedded in paraffin (hereinafter "PET"). Tissue samples from biopsies are 
commonly stored in PET for long term preservation. Formalin creates cross-linkages 
within the tissue sample which can be difficult to break, sometimes resulting in low 
5 DNA yields. Another problem associated with formalin-fixed paraffin-embedded 
samples is amplification of longer fragments is often problematic. When DNA from 
such samples is used in multiplex amplification reactions, a significant decrease in 
peak heights is seen with increasing fragment size. The microsatellite analysis 
method and kit of the present invention are preferably designed to amplify and 

10 analyze DNA from PET tissue samples. (See Example 7 for an illustration of 
amplification of such samples using a method of the present invention.) 

When the method or kit of the present invention is used in the analysis or 
detection of tumors, at least one sample of genomic DNA analyzed originates from a 
tumor. When a monomorphic or quasi-monomorphic locus, such as MONO- 11 or 

15 MONO- 15 is amplified, the size of the resulting amplified alleles can be compared to 
the most commonly observed allele size at that locus in the general population. The 
present method and kit is preferably used to diagnose or detect tumors by co- 
amplifying at least two different samples of DNA from the same individual, wherein 
one of the two samples originates from normal non-cancerous biological material. 

20 The present invention is further explained by the following examples which 

should not be construed by way of limiting the scope of the present invention. 

Example 1: Screening microsatellite markers for frequency of MSI 

25 In this example, microsatellite markers in DNA isolated from tumors were 

compared to microsatellite markers in DNA isolated from normal tissue or cells in 
order to detect MSI. Specifically, microsatellite loci were amplified from paired 
normal/tumor DNA samples and genotyped. If one or more different alleles were 
present in the tumor DNA sample that were not found in normal sample from the 

30 same individual, then it was scored as MSI positive. Di-nucleotide, tetra-nucleotide 
and penta-nucleotide repeat microsatellite markers were analyzed for frequency of 
alteration to determine the relative sensitivity of particular markers to MSL Detailed 
information about the specific procedures used in this example are provided herein, 
below. 
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Tissues and DNA isolation. Matched normal (blood) and neoplastic tissue 
samples for 39 patients were obtained from the Cooperative Human Tissue Network 
(hereinafter, "CHTN")(Ohio State University, Columbus, OH). After surgical 
resection, tissue samples were frozen in liquid nitrogen and stored at -70°C. Blood 
5 samples were collected by venipuncture using vacuum tubes. DNA extraction from 
blood and solid tissues was performed either by standard Phenol/chloroform method 
(Sambrook et al., 1989 Molecular Cloning: A Laboratory Manual Cold Springs 
Harbor Press, Cold Springs Harbor, New York) or with QIAamp Blood and Tissue 
Kit (QIAGEN, Santa Clarita, CA) following manufactures protocol. 

10 PCR and Micro satellite Analysis, Fluorescently labeled primers from 275 

microsatellite loci were used to amplify template DNA from normal/tumor pairs of 
samples. Two hundred and forty-five tetra-nucleotide repeat markers from the 
Research Genetics CHLC/Weber Human Screening Set Version 9.0 were evaluated 
(Research Genetics, Huntsville, AL). Additional primer sets for tetra-nucleotide and 

15 penta-nucleotide repeat markers were obtained from Promega Corporation (Madison, 
WI) (PowerPlex™ 16 System contains D3S1358, TH01, D21S11, D18S51, Penta E, 
D5S818, D13S317, D7S820, D16S539, CSF1PO, Penta D, vWA, D8S1179, TPOX, 
and FGA loci). Penta-nucleotide repeat markers TP53, Penta A, Penta B, Penta C, 
Penta D, Penta E, Penta F and Penta G or were custom synthesized (Promega 

20 Corporation, Madison, WI) using sequence data from public databases Di-nucleotide 
markers included for comparison purposes (D8S254, NM23, D18S35, D5S346, TP53- 
di, D2S123, D1S2883, D3S1611, D7S501) were obtained from PE Biosystems (now 
doing business as Applied Biosystems Group, Foster City, CA). 

Markers from Research Genetics, Human Screening Set Version 9.0, were 

25 multiplexed and screened for MSI using 2.5 ng of DNA in lOjxl PCR reactions 
described below. Other loci were evaluated as monoplexes using lng DNA in 25\x\ 
PCR reactions as described below. All markers were PCR amplified under the same 
conditions in using a Perkin-Elmer Gene Amp PCR System 9600 Thermal Cycler, 
except as indicated otherwise below. Micros attelite markers from the PowerPlex™ 

30 16 System (Technical Manual #TMD012, Promega Corporation, Madison, WI) and 
dinucleotide repeat markers from the Microsatellite RER Assay system (see product 
literature from PE Biosystems, non Applied Biosystems, Foster City, CA) were 
analyzed following manufacture's protocol. 
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TABLE 1 - lOjxl triplex PCR reaction for Research Genetics markers 



PCR Master Mix Component Volume Per 

Sample 

Nuclease Free Water 3.30)0.1 

10X GoldST*R Buffer (Promega) 1 .00^1 

Primer 1 0.50ul 

Primer 2 0.50^1 

Primer 3 0.50ul 

Primer4 0.50ul 

Primer 5 0.50^1 

Primer 6 0 50[ll 

AmpliTaq Gold DNA Polymerase (5Units/ul) (Perkin Elmer) 0.15|Lil 

DNA (1 ng/^1) 2 .50ul 

Total Reaction Volume _____ lO.OOu.1 

5 TABLE 2 - 25^1 PCR reaction 



PCR Master Mix Component Volume Per 

Sample 

Nuclease Free Water \1 A5\i\ 

GoldST*R 10X Buffer (Promega) 2.50^1 

10X Primer Pair Mix (lO^iM) 2.50jxl 

AmpliTaq Gold DNA Polymerase (5Units/nl) (Perkin Elmer) 0.05|nl 

Template DNA (0.4 ng/fil) 2.50jll1 

Total Reaction Volume 25.00ul 



TABLE 3 - Cycling profile for PE 9600 Thermal Cycler 

10 



1 cycle 


95°C for 11 minutes 


1 cycle 


96°C for 1 minute 


10 cycles 


94°C for 30 seconds 




ramp 68 seconds to 56°C, hold for 30 seconds 




ramp 50 seconds to 70°C, hold for 45 seconds 


20 cycles 


94°C for 30 seconds 




ramp 60 seconds to 56°C, hold for 30 seconds 




ramp 50 seconds to 70°C, hold for 45 seconds 


1 cycle 


60°C for 30 minutes 


1 cycle 


Soak 4°C 



One microliter of PCR product (Research Genetics markers were first diluted 
1:4 in IX GoldST*R PCR buffer) was combined with 1 jul of Internal Lane Standard 
15 (Promega Corporation, Madison, WI) and 24 \i\ deionized formamide. Samples were 
denatured by heating at 95°C for 3 minutes and immediately chilled on ice for 3 
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minutes. Separation and detection of amplified fragments was performed on an ABI 

® 

PRISM 310 Genetic Analyzer following the standard protocol recommended in the 
User's Manual with the following settings: 5 second Injection Time, 15 kV Injection 
Voltage, 15 kV Run Voltage, 60°C Run Temperature, and 28 minute Run Time. 
5 Assay Interpretation. Identification of normal and tumor allele sizes was 

accomplished by examining the appropriate electropherogram from the ABI PRISM 
310 Genetic Analyzer (Applied Biosystems) and determining the predominate peaks 
for each locus. One or two peaks or alleles can be present for each locus in normal 
samples depending upon whether individual is homozygous or heterozygous for a 
10 particular marker. Allelic patterns or genotypes for normal and tumor pairs were 
compared and scored as MSI positive if one or more different alleles were present in 
the tumor DNA samples that were not found in normal sample from the same 
individual. 

A wide range in frequency of alteration was observed in between samples and 

15 between markers evaluated. Samples were divided into two groups based on the 
frequency of alteration using guidelines recommended in NCI Workshop on MSI 
(Boland et al, 1998). Samples with greater that 30-40% of markers exhibiting 
alteration in tumor samples were classified as MSI-H and <30-40% as MSI-L. 
Samples with no alterations were classified as microsatellite stable (MSS). Based on 

20 this definition of MSI phenotypes, nine samples were classified as MSI-H and the 
remaining 30 as either MSI-L or MSS. 

The tetra- and penta-nucleotide repeat loci exhibited the smallest amount of 
stutter of the loci screened, above. See Figure 4 for a plot of the % stutter results 
observed at the tetra- and penta-nucleotide repeat loci. The tetra-nucleotide repeat 

25 markers also varied in frequency of alteration, ranging from 0 to 100% MSI in the 
MSI-H group (Figure 5). Penta-nucleotide markers, in general, displayed low levels 
of MSI (Figure 6). Microsatellite markers showing high sensitivity to MSI (>88% 
MSI with MSI-H samples) and high specificity (<8% MSI with MSI-L and MSS 
samples) with the CHTN samples were selected for further evaluation with 20 

30 additional normal/tumor colon cancer samples from Mayo Clinic (Rochester, MN) 
(see Example 5). 
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Example 2: Identification and characterization of mono-nucleotide repeat loci. 

Due to the highly informative nature of mono-nucleotide repeat loci in 
determining MSI phenotype, we also investigated poly (A) regions of the human 
genome as a new source of markers for MSI assays. To accomplish this, mono- 
nucleotide repeats were identified from GenBank (http://www.ncbi.nlm.nih.gov/) 
using BLASTN (Altschul, et al 1990 J. Mol Biol 215:402-410) searches for 
(A) 30 (N)3o sequences. The (N) 30 sequence was added to eliminate frequent mRNA 
hits and to assure that flanking sequence was available for designing primers for PCR. 
Next, flanking primers were designed for 33 GenBank DNA sequences using Oligo 
Primer Analysis Software version 6.0 (National Biosciences, Inc., Plymouth, MN) to 
amplify the region containing the poly (A) repeat. Evaluation of loci was performed 
using 9 MSI-H and 30 MSS colon cancer samples and corresponding normal DNA 
samples. Protocols for PCR, detection and analysis are described in Example 1. 

Two characteristics were screened for in the new loci. First, loci were 
screened for which could detect MSI in the MSI-H group and not in the MSS group. 
Secondly, loci were selected on the basis of being monomorphic or nearly 
monomorphic (quasi-monomorphic). The monomorphic nature of the new loci was 
determined by genotyping 96 samples from 5 racial groups (African-American, 
Asian- American, Caucasian-American, Hispanic-American, Indian-American). 
Screening of 33 mono-nucleotide repeat loci revealed four new mono-nucleotide 
repeat loci (MON0-3, MONO-11, MONO-15, and MONO-19) that displayed high 
sensitivity to MSI (Table 4 and Figure 7) and were relatively homozygous and 
monomorphic (Table 5). The degree of homozygosity and mono morphism detected 
at each such locus is shown on Table 6. 



TABLE 4 - Results from Screening of Mono-nucleotide Repeat Loci 



MSI 
Type 


BAT-25 


BAT-26 


MONO-3 


MONO- 
11 


MONO- 
15 


MONO- 
19 


MSI-H 


100% 


100% 


100% 


100% 


100% 


100% 


MSI-L or 
MSS 


0% 


0% 


0% 


0% 


0% 


0% 
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TABLE 5 - Polymorphism Level of Mono-nucleotide Repeat Loci 





BAT-25 


BAT-26 


MONO-11 


MONO-15 


% Homozygosity 


95% (82/86) 


95% (89/94) 


89% (76/85) 


99% (87/88) 


% Monomorphic 


95% 


95% 


89% 


99% 



5 Example 3: Population studies 

A population study was conducted in which 93 samples from African- 
American individuals were genotyped using preferred microsatellite loci selected as 
candidates for multiplexing in the studies illustrated in Examples 1 and 2, above. See 

10 Table 6, below, and Table 3, above, for the amplification conditions used. See Table 
7, below, for a list of the loci amplified and analyzed in this study. In addition, a 
pooled Human Diversity DNA sample and control CEPH DNAs 1331-1 and 1331-2 
(Coriell Cell Repository, Camden, NJ) were included in the screening population. 
African-American samples were used because they contain the greatest genetic 

15 diversity found in all racial groups. 

To facilitate screening of 96 samples with 22 different microsatellite markers, 
selected markers were multiplexed in small groups of three. Multiplexed primer sets 
were used to amplify individual sample DNAs using conditions described below. 

20 

TABLE 6 - 25(xl PCR reaction 



PCR Master Mix Component Volume Per 

Sample 

Nuclease Free Water 17.30jLil 
GoldST*R 10X Buffer (Promega) 2.50|Lil 
10X Triplex Primer Mix (1 to 10)iM each) 2.50jxl 
AmpliTaq Gold DNA Polymerase (SUnits/jil) (Perkin Elmer) 0.20|Lil 
Template DNA (0.4 ng/fil) 2.50^1 
Total Reaction Volume 25.00|al 



The results of the population study are summarized in Table 7. The size of the 
25 smallest and largest allele for each locus was identified to determine allele size range. 
To calculate percent stutter, the peak height of the stutter band was divided by the 
peak height generated by the true allele, then multiplied by 100. Minimum and 
maximum stutter values were calculated for each locus as well as the combined 
average percent stutter from 20 random samples. 
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TABLE 7 - Summary of Results of Population Study 





GenRank ID # 


Allele Size Range 








vrc 11 JO JK 


JrOp k5lUQy 


% Stutter 


BAT-25 


U63834 


1 8 hn 


49 hn 
h-z up 


ND 


BAT-26 


U41210 


1 8 hn 


1 9 hn 
iz up 


ND 


MONO- 11 


AC007684 


KTTi 
rsXJ 


1 A Kta 

14 Dp 


1 > JU' 


MONO- 15 


AC007684 

i vv — \J\J 1 vJ 


INJJ 


o bp 




D 1 S547 


G07898 


40 Dp 


If* K« 
ZD Dp 


A Q 

4.I7 


D1S518 


0078*54 


A Q Kr\ 

4o Dp 


"NTT^ 
JNJJ 




D1 S 1 677 




AC\ Kn 

4U Dp 


jj bp 


Q 7 


D2S1790 


f!081Q0 


DO Dp 


44 bp 


7 & 


jlv j o zh- j z 


no8940 


0/ Dp 


40 bp 


s o 

o.U 


DSS81 8 


On 844n" 


36 bp 


JND 


LNJU 


D5S9849 


fr1 S7S9 
*J 1 J / JZ 


4U bp 


3/ bp 


J.J 




\J\JO J JU 


45 bp 


36 bp 


f Q 


JLV / O 1 OVJO 




jo bp 


A A 1 

44 bp 


7 fx 
1 .0 


D79^0zt6 


gi o^^ 


48 bp 


71 bp 


1 0 o 


D7 < s^07n 


VJZ / J4U 


44 bp 


A A U.« 

44 bp 


in o 

1U. J 


1 70 

ju'oo 1 1 / y 


VjrUo / IU 


A A 1 

44bp 


ND 


NTT* 




OUo /4o 


iz Dp 


JNJJ 


JND 


D10S677 




28 bp 


40 bp 


< c 

J.J 


D10S1426 


G08812 


28 bp 


ND 


ND 


D10S2470 


G10285 


48 bp 


29 bp 


5.9 


D12S391 


G08921 


52 bp 


48 bp 


7.6 


D17S1294 


G07967 


44 bp 


28 bp 


7.2 


D17S1299 


G07952 


40bp 


ND 


ND 


D18S51 


LI 8333 


76bp 


ND 


ND 


FGA 


M64982 


120bp 


ND 


ND 



Example 4: MSI Multiplex Design 

In order to develop a multiplex MSI assay system which is highly sensitive to 
MSI, with minimal stutter, and minimal incidence of LOH, the criteria listed in Table 
8, below, were used to screen loci identified in the Examples above as possible 
10 candidates for use in MSI analysis: 



TABLE 8 - MSI Loci Specifications for Use in Multiplex 



Monoplex specifications 




Tetra-nucleotides 


>70% MSI in MSI-H samples 




<8% MSI with MSI-L and MSS samples 




LOH <14% in MSI-H samples 




Average % Stutter <13% 


Mono-nucleotides 


100% MSI in MSI-H samples 




0% MSI with MSI-L and MSS samples 
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Multiplex specifications 


y loci, J mono- ana o teira-nucieoiiaes 




All amplicons <250bp 




T?r*Hnet Qmr\1ifir , Qtir\n nf T^KFA frnm P"FT Q^mnlpQ 
ivUUUaL d-llipilllL/aLlUll Ul -L/INjTY IIVJ111 JT J_* X oalll]Jit/o 




Robust amplification of 1 to 2 ng DNA 




Balanced peak heights between all loci in multiplex 




Sensitivity > 99.9% 




Specificity >99.9% 



The loci listed in Table 9, below, were identified as loci meeting the specifications 
listed in Table 8, above. 



TABLE 9 - Preferred Microsatellite Loci for Multiplexing 



Locus 


Tvnp 


vjcIlJDclIltV 

No. 


jt i unci 

SEO ID 


% MST 
(MSI-H) 


% LOH 
(MSI-H) 


% MSI 
(MSS or 
MSI-L) 




A/Tnnn 


U63834 


1 2 


100% 


0% 


0% 


RAT 0f\ 

DA 1 jL\j 


A/Ton n 


U41210 


3 4 


100% 


0% 


0% 


MONO-1 1 


M!ono 


AC007684 


5, 6 


100% 


0% 


0% 


MONO-1 S 


A/Ton n 


AC007684 


7 8 


100% 


0% 


0% 


D1S518 


Tetra 


G07854 


9, 10 


83% 


0% 


0% 


D1S547 


Tetra 


G07828 


11, 12 


78% 


3% 


0% 


D1S1677 


Tetra 


G09926 


13, 14 


80% 


0% 


3% 


D2S1790 


Tetra 


G08190 


15, 16 


82% 


3% 


3% 


D3S2432 


Tetra 


G08240 


17, 18 


83% 


3% 


3% 


D5S818 


Tetra 


G08446 


19,20 


72% 


14% 


3% 


D5S2849 


Tetra 


G15752 


21,22 


76% 


7% 


0% 


D6S1053 


Tetra 


G08556 


23,24 


76% 


0% 


0% 


D7S1808 


Tetra 


G08643 


25, 26 


90% 


0% 


0% 


D7S3046 


Tetra 


G10353 


27,28 


93% 


0% 


0% 


D7S3070 


Tetra 


G27340 


29, 30 


86% 


3% 


3% 


D8S1179 


Tetra 


G08710 


31, 32 


75% 


7% 


7% 


D9S2169 


Tetra 


G08748 


33, 34 


72% 


3% 


0% 


D10S1426 


Tetra 


G08812 


35, 36 


86% 


3% 


0% 


D10S2470 


Tetra 


G10285 


37, 38 


83% 


3% 


0% 


D12S391 


Tetra 


G08921 


39,40 


79% 


3% 


0% 


D17S1294 


Tetra 


G07967 


41,42 


86% 


3% 


0% 


D17S1299 


Tetra 


G07952 


43,44 


79% 


3% 


0% 


D18S51 


Tetra 


L18333 


45,46 


75% 


7% 


0% 


FGA 


Tetra 


M64982 


47,48 


82% 


7% 


7% 



* MSI-H samples: N=29 and MSI-US samples N=30. 



Example 5: Analysis of Mismatch Repair Genes 

In order to determine the underlying cause of MSI in MSI-H tumor samples 
used in developing the Multiplex MSI Assay, protein expression levels for MLH1 and 
MSH2 genes were examined. Immunohistochemical analysis of paraffin-embedded 
tissues from eight MSI-H samples was performed as described in Thibodeau et al 
{Cancer Research 58, 1713-1718). Lack of protein expression in MLH1 and MSH2 
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genes is expected in tumor samples exhibiting high levels of MSI and is an indication 
of dysfunctional mismatch repair system. 

The results of the immunohistochemical assays on the MSI-H tumor samples 
is shown in Table 10. 

TABLE 10 - Protein Expression of MSH1 And MSH2 in MSI-H Cancer Samples 



Tumor 
Sample 


Source 


MSI 
Phenotype 

*J XT 


Protein expression 


TTA/fT 141 


JtllVloxlZ 


C172 


CHTN 


MSI-H 




i 


C404 


CHTN 


MSI-H 




i 

~r 


C507 


CHTN 


MSI-H 




i 

T 


C546 


CHTN 


MSI-H 




+ 


C624 


CHTN 


MSI-H 


NTT* 




C710 


CHTN 


MSI-H 






C1166 


CHTN 


MSI-H 




+ 


C5412 


CHTN 


MSI-H 




+ 


S 15945 


CHTN 


MSI-H 


+ 


+ 


A-l 


Mavo Clinic 


MSI-H 




+ 


A-5 


Mayo Clinic 


MSI-H 




+ 


A-7 


Mavo Clinic 


MSI-H 


+ 




A-15 


A4avo Clinic 


MSI-H 




+ 


A-19 


Mavo Clinic 


MSI-H 




+ 


A-29 


A/Tavn Clinir* 


MST-H 

J.VX»JX XX 




+ 


A-49 


A/Tpivn Clinir* 


MST-H 

IVXkJX x± 


+ 




A-50 


A/Tavn Clinir* 


MSI-H 




+ 


A-73 


TVTavo Clinic 


MSI-H 




+ 


A- 102 


TVfavo Clinic* 

J.TJLC4- V V,/ 


MSI-H 


+ 




B-2 


Mavo Clinic 

1TJLU V \~r \ '11 Ml V 


MSI-H 




+ 


B-52 


Mavo Clinic 

1TJ.U. y \_/ v-'iiniv 


MSI-H 




+ 


B-61 

XJ \J -L 


IVT^ivn Clinir* 


MST-H 




+ 


B-75 


A/T^ivn Clinir* 


MSI-H 




+ 


B-76 


A/Iavo Clinic 


MSI-H 




+ 


B-93 




IVlvJl - ! J. 




+ 


B-107 

LJ X\J / 


IVT^ivn Clinip 


MST-H 




I 

T 


B-155 


Mayo Clinic 


MSI-H 






B-164 


Mayo Clinic 


MSI-H 




+ 


B-166 


Mayo Clinic 


MSI-H 




+ 


B-173 


Mayo Clinic 


MSI-H 




+ 


B-199 


Mayo Clinic 


MSI-H 




+ 


B-209 


Mayo Clinic 


MSI-H 




+ 


B-210 


Mayo Clinic 


MSI-H 




+ 


B-268 


Mayo Clinic 


MSI-H 




+ 


B-299 


Mayo Clinic 


MSI-H 




+ 


B-334 


Mayo Clinic 


MSI-H 




+ 


B-379 


Mayo Clinic 


MSI-H 




+ 


B-402 


Mayo Clinic 


MSI-H 




+ 


B-564 


Mayo Clinic 


MSI-H 




+ 
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Example 6: MSI Multiplex Assay Development and Validation 

Once the best loci were selected for use in designing multiplexes to be 

5 analyzed according to the methods of the present invention, problems associated with 
multiplex PCR and incompatibility between loci needed to be overcome. This 
required careful primer design and extensive trial and error to find loci that were 
capable of simultaneous amplification using a single set of PCR conditions. Problems 
encountered included: (1) primer-primer interactions that occurred when large 

10 number of oligos were combined in a single PCR reaction, (2) primer design 
limitations due to sequence constraints at a particular locus (e.g., minimum size of 
amplicon allowed by DNA sequence, sub-optimal %GC of primers, difficulty 
balancing Tm's for all primers under uniform PCR conditions, difficulty in finding 
primers with desirable thermal profiles to minimize non-specific amplification, 

15 hairpin formation and self dimerization of primers, homology to other repeat 
sequences in human genome), and (3) multiplex design allowing separation of all 9 
loci within limited size range of 250 bp. 

Based on extensive evaluation of close to 300 microsatellite markers described 
in Examples 1 through 5, nine loci were selected for the preferred MSI Multiplex 

20 Assay (Table 11). Three loci are monoplex repeats (BAT-25, BAT-26 and MONO- 
15) and six were tetra-nucleotide repeats (D7S3046, D10S1426, D10S2470, 
D7S3070, D17S1294, D7S1808). These loci represent the best known set of loci 
known for determining MSI in tumor samples. Results of MSI analysis on 29 MSI-H 
and 30 MSI-L or MSS colon cancer samples using the preferred nine-locus multiplex 

25 are summarized in Figure 8. 

A typical example of MSI Multiplex is shown in Figure 9. The image was 
generated by simultaneous amplifying all nine selected microsatellite loci followed by 
separation of PCR products on an ABI 310 CE. Separation of all nine microsatellite 
loci in a single capillary (or gel lane) was accomplished by designing the multiplex so 

30 that loci would not overlap in size or through use of different fluorescent dyes. The 
size ranges for the different multiplex loci were determined by genotyping 93 samples 
from African-American individuals using MSI Multiplex described following 
protocol described below. In addition, a pooled Human Diversity DNA sample and 
control CEPH DNAs 1331-1 and 1331-2 (Cornell Cell Repository) were included in 
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the screening population. African-American samples were used because they contain 
the greatest amount of genetic diversity found in all racial groups. 

TABLE 1 1 - MSI Multiplex Assay Loci and Primers 



Locus 


GenBank 


Repe 


Dye 


Size 


Primer 1 


Primer 2 




ID No. 


at 




Range 


(SEQ. ID.) 


(SEQ. ID.) 






Type 










BAT-25 


U63834 


Mono 


TMR 


118-127 


1 


60 


D10S1426 


G08812 


Tetra 


TMR 


152-173 


57 


58 


D3S2432 


G08240 


Tetra 


TMR 


198-234 


17 


59 


BAT-26 


U41210 


Mono 


FL 


103-116 


61 


62 


D7S3046 


G10353 


Tetra 


FL 


122-163 


55 


56 


D7S3070 


G27340 


Tetra 


FL 


186-249 


53 


54 


MONO-15 


AC007684 


Mono 


JOE 


115-117 


7 


8 


D1S518 


G07854 


Tetra 


JOE 


136-178 


49 


50 


D7S1808 


G08643 


Tetra 


JOE 


190-218 


51 


52 



Protocol for MSI Multiplex Assay, Template DNA from normal and tumor 
tissues obtained from same individual were purified using QIAamp Blood and Tissue 
10 Kit (QIAGEN, Santa Clarita, CA) following manufactures protocol. Two nanograms 
of template DNA in a 25 |il reaction volume was PCR amplified using protocol 
detailed in Table 12, below, using the cycling profile described in Table 3, above. 



15 



TABLE 12 - Amplification Mix for MSI Multiplex Assay 

PCR Master Mix Component Volume Per Sample 

Nuclease Free Water 17.00|xl 

GoldST*R 10X Buffer (Promega) 2.50^1 

Primer Pair Mix (10|LiM) 2.50|il 

AmpliTaq Gold DNA Polymerase (Perkin Elmer) 0.50|il 

Template DNA ( 0.8ng/jal) 2.50^1 

Total Reaction Volume 25.00ixl 



One microliter of PCR product was combined with 1 JlxI of Internal Lane 
Standard (Promega Corporation, Madison, WI) and 24 jLil deionized formamide. 
20 Samples were denatured by heating at 95°C for 3 minutes and immediately chilled on 
ice for 3 minutes. Separation and detection of amplified fragments was performed on 
an ABI PRISM 310 Genetic Analyzer following the standard protocol recommended 
in the User's Manual with the following settings: 
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Run Module: GS STR POP4 (Filter set A) 

Injection Time: 4 seconds 

Injection Voltage: 15 kV 

Run Voltage: 15 kV 

5 Run Temperature: 60°C 

Run Time: 24 minutes 

Identification of normal and tumor allele amplicon sizes was accomplished by 
examining the appropriated electropherogram from the ABI PRISM 310 Genetic 

10 Analyzer and determining the predominate peaks for each locus. One or two peaks or 
alleles were present for each locus in normal samples depending upon whether 
individual was homozygous or heterozygous for a particular marker. Allelic patterns 
or genotypes for normal and tumor pairs were compared and scored as MSI positive if 
one or more different alleles were present in the tumor DNA samples that were not 

15 found in normal sample from the same individual. Typical examples of results 
obtained using multiplex designed for MSI analysis are shown in Figures 10 and 11 
for colon cancer and Figure 12 for stomach cancer. 



Example 7: Amplification of DNA from PET Samples using MSI Multiplex 

20 Microsatellite loci selected for the preferred multiplex were evaluated for their 

ability to amplify DNA from formalin-fixed paraffin-embedded samples. DNA was 
extracted from three 10 micron sections cut from PET blocks using QIAamp Tissue 
Kit (Qiagen, Santa Clarita, CA) according to the manufacture's instructions with the 
following modifications. One hundred microliters of QIAGEN AE buffer preheated 

25 to 70°C was added to column, incubated for 5 minutes, centrifuged, then reapplied to 
column for second elution. Two microliters (out of lOOpl) of purified DNA solution 
was used as template for PCR reactions. The nine locus multiplexed primer set 
described in Example 6 was used to amplify DNA from PET samples. The results 
indicate that the MSI Multiplex is capable of amplifying DNA from difficult and 

30 commonly used PET samples (Figure 13). 

While the present invention has now been described and exemplified with 
some specificity, those skilled in the art will appreciate the various modifications, 
including variations, additions, and omissions that may be made in what has been 
described. Accordingly, it is intended that these modifications also be encompassed 

35 by the present invention and that the scope of the present invention be limited solely 
by the broadest interpretation that lawfully can be accorded the appended claims. 
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CLAMS 



We claim: 
1. 

5 a) 



b) 



10 



c) 



A method of analyzing micro-satellite loci, comprising steps of: 
providing primers for co-amplifying a set of at least three 
microsatellite loci of genomic DNA, comprising at least one mono- 
nucleotide repeat locus and at least two tetra-nucleotide repeat loci; 
co-amplifying the set of at least three microsatellite loci from at least 
one sample of genomic DNA in a multiplex amplification reaction, 
using the primers, thereby producing amplified DNA fragments; and 
determining the size of the amplified DNA fragments. 



15 



20 



2. The method of claim 1, wherein the genomic DNA is human genomic 



DNA. 



3. The method of claim 2, wherein the at least two tetra-nucleotide repeat 
loci are selected from the group consisting of FGA, D1S518, D1S547, D1S1677, 
D2S1790, D3S2432, D5S818, D5S2849, D6S1053, D7S3046, D7S1808, D7S3070, 
D8S1179, D9S2169, D10S1426, D10S2470, D12S391, D17S1294, D17S1299, and 
D18S51. 



4. The method of claim 2, wherein the at least one mono-nucleotide 
repeat locus is selected from the group consisting of BAT-25, BAT-26, MONO-11, 
and MONO-15. 

25 

5. The method of claim 2, wherein the set of at least three microsatellite 
loci is a set of at least five microsatellite loci, comprising: 

at least two mono-nucleotide repeat loci selected from the group 
consisting of BAT-25, BAT-26, MONO-11, and MONO-15; and 
30 at least three tetra-nucleotide repeat loci selected from the group 

consisting of FGA, D1S518, D1S547, D1S1677, D2S1790, D3S2432, 
D5S818, D5S2849, D6S1053, D7S3046, D7S1808, D7S3070, D8S1179, 
D9S2169, D10S1426, D10S2470, D12S391, D17S1294, D17S1299, and 
D18S51. 
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6. The method of claim 2, wherein at least one of the primers provided in 
step (a) has a nucleic acid sequence selected from the group of primer sequences 
identified by: SEQ ID NO: 1- 62. 

7. The method of claim 2, wherein the set of at least three microsatellite 
loci is a set of at least nine microsatellite loci, comprising BAT-25, BAT-26, 
MONO-15, D1S518, D3S2432, D7S1808, D7S3070, D7S3046, D10S1426. 

8. The method of claim 7, wherein the set of at least nine microsatellite 
loci is co-amplified using at least one primer for each locus selected from the group 
consisting of: 

SEQ ID NO: 1 and SEQ ID NO: 60 when the locus is BAT-25, 
SEQ ID NO: 61 and SEQ ID NO: 62 when the locus is BAT-26, 
SEQ ID NO: 7 and SEQ ID NO: 8 when the locus is MONO- 15, 
SEQ ID NO: 49 and SEQ ID NO: 50 when the locus is D1S518, 
SEQ ID NO: 17 and SEQ ID NO: 59 when the locus is D3S2432, 
SEQ ID NO: 51 and SEQ ID NO: 52 when the locus is D7S1808, 
SEQ ID NO: 53 and SEQ ID NO: 54 when the locus is D7S3070, 
SEQ ID NO: 55 and SEQ ID NO: 56 when the locus is D7S3046, and 
SEQ ID NO: 57 and SEQ ID NO: 58 when the locus is D10S1426. 

9. The method of claim 1, wherein the set of at least three microsatellite 
loci is co-amplified in step (c) using at least one oligonucleotide primer for each 
locus which is fluorescently labeled. 

10. The method of claim 1, wherein the at least one sample of genomic 
DNA comprises a first sample of genomic DNA originating from normal non- 
cancerous biological material from an individual and a second sample of genomic 
DNA originating from a tumor of the individual, the method further comprising: 

detecting microsatellite instability by comparing the size of the 
amplified DNA fragments produced from co-amplifying the first sample of 
genomic DNA to the size of the amplified DNA fragments produced from co- 
amplifying the second sample of genomic DNA. 
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11. The method of claim 10, wherein the microsatellite instability results 
are used in prognostic tumor diagnosis. 

12. The method of claim 10, wherein the microsatellite instability results 
are used in the diagnosis of familial tumor predisposition. 

13. The method of claim 10, wherein the microsatellite instability results 
are used to detect cancerous tumors of the gastro-intestinal system and of the 
endometrium. 

14. The method of claim 13 wherein the cancerous tumors are tumors from 
a colorectal cancer. 

15. A method of detecting microsatellite instability in genomic DNA, 

comprising the steps of: 

a) providing primers for co-amplifying a set of at least three 
microsatellite loci of genomic DNA, comprising at least one mono- 
nucleotide repeat locus and at least two tetra-nucleotide repeat loci; 

b) co-amplifying the set of at least three microsatellite loci from a first 
sample of genomic DNA originating from normal non-cancerous 
biological material from an individual and a from a second sample of 
genomic DNA originating from a second biological material from the 
individual, in separate multiplex amplification reactions, using the 
primers, thereby producing first amplified DNA fragments from the 
first sample and second amplified DNA fragments from the second 
sample; and 

c) comparing the size of first amplified DNA fragments to the size of the 
second amplified DNA fragments to detect instability in any of the at 
least three microsatellite loci of the second genomic DNA. 

16. The method of claim 15, wherein the primers provided in step (a) are 
primers to microsatellite loci of human genomic DNA. 
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17. The method of claim 16, wherein the two tetra-nucleotide repeat loci 
are selected from the group consisting of FGA, D1S518, D1S547, D1S1677, 
D2S1790, D3S2432, D5S818, D5S2849, D6S1053, D7S3046, D7S1808, D7S3070, 
D8S1179, D9S2169, D10S1426, D10S2470, D12S391, D17S1294, D17S1299, and 

5 D18S51. 

18. The method of claim 16, wherein the mono-nucleotide repeat locus is 
selected from the group consisting of BAT-25, BAT-26, MONO-11, and MONO-15. 

10 19. The method of claim 16, wherein the set of at least three microsatellite 

loci is a set of at least five microsatellite loci, comprising two mono-nucleotide 
repeat loci selected from the group consisting of BAT-25, BAT-26, MONO-11, and 
MONO-15, and three tetra-nucleotide repeat loci selected from the group consisting 
of FGA, D1S518, D1S547, D1S1677, D2S1790, D3S2432, D5S818, D5S2849, 

15 D6S1053, D7S3046, D7S1808, D7S3070, D8S1179, D9S2169, D10S1426, 
D10S2470, D12S391, D17S1294, D17S1299, and D18S5L 

20. The method of claim 16, wherein at least one of the primers provided 
in step (a) has a nucleic acid sequence selected from the group of primer sequences 

20 identified by: SEQ ID NO: 1- 62. 

21. The method of claim 15, wherein the at least one primer for each locus 
provided in step (a) is fluorescently labeled. 

25 22. The method of claim 16, wherein the set of at least two microsatellite 

loci is a set of at least nine microsatellite loci, comprising: BAT-25, BAT-26, 
MONO-15, D1S518, D3S2432, D7S1808, D7S3070, D7S3046, D10S1426. 

23. The method of claim 22, wherein the set of at least nine microsatellite 
30 loci is co-amplified using at least one primer for each locus selected from the group 
consisting of: 

SEQ ID NO: 1 and SEQ ID NO: 60 when the locus is BAT-25, 
SEQ ID NO: 61 and SEQ ID NO: 62 when the locus is BAT-26, 
SEQ ID NO: 7 and SEQ ID NO: 8 when the locus is MONO-15, 
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SEQ ID NO: 49 and SEQ ID NO: 50 when the locus is D1S518, 
SEQ ID NO: 17 and SEQ ID NO: 59 when the locus is D3S2432, 
SEQ ID NO: 51 and SEQ ED NO: 52 when the locus is D7S1808, 
SEQ ID NO: 53 and SEQ ID NO: 54 when the locus is D7S3070, 
5 SEQ ID NO: 55 and SEQ ID NO: 56 when the locus is D7S3046, and 

SEQ ID NO: 57 and SEQ ID NO: 58 when the locus is D10S1426. 

24. The method of claim 15, wherein the second sample of biological 
material is selected from the group consisting of: tumor tissue, disseminated cells, 

10 feces, blood cells, blood plasma, serum, lymph nodes, urine, and other bodily fluids. 

25. The method of claim 15, wherein the microsatellite instability results 
are used in prognostic tumor diagnosis. 

15 26. The method of claim 15, wherein the microsatellite instability results 

are used in the diagnosis of familial tumor predisposition. 

27. The method of claim 15, wherein the microsatellite instability results 
are used to detect cancerous tumors of the gastro-intestinal system and of the 

20 endometrium. 

28. The method of claim 27 wherein the cancerous tumors are tumors from 
a colorectal cancer. 

25 29. A method of analyzing at least one mono-nucleotide repeat locus, 

comprising the steps of: 

a) providing at least one primer for at least one mono-nucleotide repeat 
locus of human genomic DNA selected from the group consisting of 
MONO- 11 and MONO- 15; 
30 b) amplifying at least one mono-nucleotide repeat locus from a sample of 

genomic DNA originating from a biological material from an 
individual, using the at least one primer, thereby producing amplified 
DNA fragments; and 
c) determining the size of the amplified DNA fragments. 
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30. The method of claim 29, wherein the at least one mono-nucleotide 
repeat locus is co-amplified using at least one primer selected from the group 
consisting of: SEQ ID NO: 5-8. 

5 

31. The method of claim 29, wherein the at least one mono-nucleotide 
repeat locus is amplified using at least one oligonucleotide primer which is 
fluorescently labeled. 

10 32. The method of claim 29, wherein the biological material is selected 

from the group consisting of: tumor tissue, disseminated cells, feces, blood cells, 
blood plasma, serum, lymph nodes, urine, and other bodily fluids. 

33. The method of claim 29, further comprising detecting microsatellite 
15 instability at the at least one mono-nucleotide repeat locus by comparing the size of 

the amplified DNA fragments to the most commonly observed allele size at that 
locus in a human population. 

34. The method of claim 29, further comprising amplifying the at least one 
20 mono-nucleotide repeat locus of a sample of human genomic DNA from normal non- 
cancerous biological material from the individual, and comparing resulting second 
amplified DNA fragments to the amplified DNA fragments obtained in step (b) to 
detect microsatellite instability in step (c). 

25 35. A kit for analyzing microsatellite loci of human genomic DNA, 

comprising: 

a single container with oligonucleotide primers for co-amplifying a set 
of at least three microsatellite loci of human genomic DNA, the set comprising 
one mono-nucleotide repeat locus and two tetra-nucleotide repeat loci. 

30 

36. The kit of claim 35, wherein the two tetra-nucleotide repeat loci are 
selected from the group consisting of FGA, D1S518, D1S547, D1S1677, D2S1790, 
D3S2432, D5S818, D5S2849, D6S1053, D7S3046, D7S1808, D7S3070, D8S1179, 
D9S2169, D10S1426, D10S2470, D12S391, D17S1294, D17S1299, and D18S51. 
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37. The kit of claim 35, wherein the mono-nucleotide repeat locus is 
selected from the group consisting of BAT-25, BAT-26, MONO-11, and MONO-15. 

5 38. The kit of claim 35, wherein the set of at least three microsatellite loci 

is a set of at least five microsatellite loci, comprising two mono-nucleotide repeat 
loci selected from the group consisting of BAT-25, BAT-26, MONO-11, and 
MONO-15, and three tetra-nucleotide repeat loci selected from the group consisting 
of FGA, D1S518, D1S547, D1S1677, D2S1790, D3S2432, D5S818, D5S2849, 
10 D6S1053, D7S3046, D7S1808, D7S3070, D8S1179, D9S2169, D10S1426, 
D10S2470, D12S391, D17S1 294, D 1 7S 1 299, and D 1 8S 5 1 . 



39. The kit of claim 35, wherein the set of at least two microsatellite loci is 
a set of at least nine microsatellite loci, comprising: BAT-25, BAT-26, MONO-15, 
15 D1S518, D3S2432, D7S1808, D7S3070, D7S3046, D10S1426. 



40. The kit of claim 35, wherein at least one of the oligonucleotide 
primers is selected from the group consisting of: 

SEQ ID NO: 1 and SEQ ID NO: 60 when the locus is BAT-25, 

20 SEQ ID NO: 61 and SEQ ID NO: 62 when the locus is BAT-26, 

SEQ ID NO: 7 and SEQ ID NO: 8 when the locus is MONO-15, 
SEQ ID NO: 49 and SEQ ID NO: 50 when the locus is D1S518, 
SEQ ID NO: 17 and SEQ ED NO: 59 when the locus is D3S2432, 
SEQ ID NO: 51 and SEQ ID NO: 52 when the locus is D7S1808, 

25 SEQ ID NO: 53 and SEQ ID NO: 54 when the locus is D7S3070, 

SEQ ID NO: 55 and SEQ ID NO: 56 when the locus is D7S3046, and 
SEQ ID NO: 57 and SEQ ID NO: 58 when the locus is D10S1426. 



41. The kit of claim 35, wherein at least one oligonucleotide primer for 
30 each of the at least three microsatellite loci in the set is fluorescently labeled. 

42. The kit of claim 35, wherein the kit further comprises a thermostable 
polymerase. 
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43. The kit of claim 35, wherein the kit further comprises first control 
DNA isolated from normal non-cancerous biological material, and second control 
DNA from biological material lacking mismatch repair genes. 

44. The kit of claim 35, wherein instability in the set of microsatellite loci 
co-amplified by the primers can be used in prognostic tumor diagnosis. 

45. The kit of claim 35, wherein instability in the set of microsatellite loci 
co-amplified by the primers can be used in the diagnosis of familial tumor 
predisposition. 

46. The kit of claim 35, wherein instability in the set of microsatellite loci 
co-amplified by the primers is an indication of tumors of the gastrointestinal system 
or of the endometrium. 

47. The kit of claim 44, wherein the microsatellite instability is an 
indication of colorectal cancer. 
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ABSTRACT 



Methods and kits are disclosed for use in the analysis of microsatellite 
instability in genomic DNA. Methods and kits are also disclosed which can be used 
to detect microsatellite instability DNA present in biological materials, such as 
tumors. The methods and kits of the present invention can be used to detect or 
diagnose diseases associated with microsatellite instability, such as certain types of 
cancer. 



0) 

o 
o 



c 

CO 

c 

CL 



o 
o 

0) 

o 

C 

03 

O 
I- 



o 
o 

O 
£ 



0) 
JO 
+■» 

o 
o 

3 
C 



<D 
O 

o 

3 
C 

o 
c 
o 



CO 


1 1 
I I 




1 1 


■-C' 


1 1 


LO 


J I 




11 


CO 


1 1 


CM 


11 




1 1 


oc- 


1 1 


r~-> 


1 1 
1 1 


u=- 


1 1 


ir> 


1 1 


■^3" 






1 1 
1 1 


C\l 


| 1 




1 J 


CO 


J 1 


r-- 


ill* 

ir 


-iD 


V l 


TT 


I'll 




ITlt 


C--J 


I 1 




VJ1 


CO 


113 


r-- 


III 




IT 11 
■ I I 1 


IT> 


J i ft | 




ft 1 
HI 


CO 


It; 


Cvl 


' hi 




i l 


'.O 


•nmn- 

l 4 i * ++++ f 


■^r 


T »«*Wf ***** 


CO 




OJ 









as 

CO 



cS 

CO 




Low Stutter Tetra-nucleotide 



□ S GeneScan w Pmject-6/4/£ODis..- Mi 



ISO 200 210 220 230 240 



74SO 
6S00 
6120 
5440 
4760 
40S0 
3400 
2720 
2040 
13SO 

eso 
o 



74SO 
G30O 

6iao 

5440 
47SO 
40SO 
340O 
2720. 
2040. 
1380. 
6SO 
O 





Normal #1a 


h 


\ 



LUll 



2G :52N / TRIPLEX 3 



Tumor #1 a 



X : 230 



MS 4G : 52T / TRIPLEX 3 
Y: 1942 | ' ' 



IE 



High Stutter Di-nucleotide 




P3 



GerceScan™ Project- 6/4/00 Display- 3 1 



1800 
1650 
1500 
1350, 
1200 
1050. 
900. 
750. 
600 
4S0. 
300, 
150, 
0 



4S00 
3750 
3000 
2250 
1500 
750 
0 



Normal #2a 



Hf[U 



IB ZN / TRIPLEX 3 



Tumor #2a 



[MEB 3B • 2T / TRIPLEX 3 



□ mm GeneScan^'Ptxyect-Z/ia^OODispla... nEI 

m 



3600 
3240 
2SS0 
2520 
2160 
1800 
1440 
1030 
720 
360 
0. 



3600 
3240 
2S30 
2520 
2160 
1S00 
1440 
1080 
720 
360 
0 



Normal #2b 



□ H 4G : 2N / Mayo samples, Roche kit 



Tumor #2b 



3G :2T / Mayo samples, Roche kit 



FIG. 3 



18 



16 



14 



12 



10 































































— 1 


1 — I 


1 1 


S I 


I 

1 


I 














1 

I 


I 1 


1 1 


1 

1 


1 
























I 

1 


i— 1 
























i ■ 


1 1 


1 

1 


i— 1 


1 ■ 

















CO 

c 

CD 
CL 



< 

CO 
-I— ' 

c 

CD 



O 

CO 

-4—* 

c 

CD 



LU 

CO 

c 
0 

CL 



o 

X 



CD 
TO 
c 
CD 
CL 



o 

CL 



o 

CN 
00 
CO 

Q 



O 
Q_ 

LL 
CO 

o 



CO 

CO 
CO 



CO 
LO 
CO 
CO 



CO 

00 

Q 



< 

Q LL 



CO 

CO 
CO 
LO 

Q 
Q 



CO 
LO 
CO 

CO 

CO 

Q 



LO 

CO 

CO 



CO 

CN 
Q 



FIG. 4 




% Microsatellite Instability 



FIG. 5 




TP53-penta Penta A Penta B Penta C Penta D Penta E Penta F Penta G 



FIG. 6 



0 



GeneScatf Project-6/4/00 Display- liiiiffl i 



30 



120 



3000. 
£000^ 
1000J 
0" 

1500. 

iooo. 

500_ 
0. 

3000. 

zooo. 

1000. 
0. 

1500. 
1000. 
500. 

0 



A 



HB SamDie A: Normal 



, W i 



3B Samole A. Tumor 



ML 



IB 



Samole B. Normal 



X:93 



SamDie B Tumor 

Y: 105 I \ Mm immf^M§9S0mm * I ► 



□ GeneScan™ Project-6/4/0G Display-4 m 3 E 



30 



120 



3000. 
2000. 
1000. 
0 



SIB SamDie C. Normal 



1000. 
500. 
0 



HB SamDie C* Tumor 



sooo; 

2000, 
1000 
0 



A 'ft 



HB SamDie D: Normal 



500 



0 



[211 SamDie D: Tumor 
X: 104 Y: 707 



i ► 



FIG. 7 




D1S518 D3S2432 D7S1808 D7S3046 D7S3070 D10S1426 BAT-25 BAT-26 MONO-15 



FIG. 8 



GeneSc^n^ Project-B/25/W Display- 1 i 



110 120 130 140 150 ISO 170 ISO 190 200 210 220 230 

-J I I I 1 I I I I u 



I I I 



1200 

600: 



^1 



UH ib 
□■ iv 



1G 



1200; 
600 2 



BAT-25 

A 



D7S3070 



D7S3046 



J 



1B 



1200 

eoo: 
o : 



MONO-15 

A 



D1S51! 



D7S1808 



II 1G : 



1200j 


BAT-26 




D10S1426 




|D3S2432 


600 i 

0 i 


A 


<-> 


I 


.J 





□ ■ 1Y 



X: 



FIG. 9 



GeneScan™ Pro] eci>8/3 1 /CO Pis pi ay- 3 i 



1m 100 HQ 120 130 140 150 160 170 130 190 200 210 220 £30 240 



2400 J 1 
0 1 



900. 
0 



2400. 
0 



4B : Panel A: Tumor 



MONO-15 

A 



D1S518 



3G : Panel B: Normal 



2400 
1200.3 
0 



A 



40 : Panel B: Tumor 



2900 
1450 
0 



BAT-26 

A 



D10S1426 



D3S2432 




□■ 3Y : Panel C: Normal 



□B 4Y : Panel C: Tumor 



1 ' 1 1— 

BAT-25 

A . 


i 

.h) 


_l 1 1 1 

D7S3070 

* - <"■ ■ — 


i 1 ■ — ■ — 

D7S3046 1 


1 1 

li 


SB 3B : Panel A: Normal 






\ 

A... 1 


UL_ - _— ^- — — a- 


LaJl 






FIG. 10 



GeneScan™ Project- 8/3 1/00 Display-4 ! 



90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 

i i i i i i i i i i i ■ ■ i i i 



1200 
600 j 
0 : 



30D0; 

isooi 



BAT-25 



D7S3070 



D7S3046 



1 1 B : Panel A: Normal 




II ,2B: Panel A: Tumor 



MONO-15 



D1S518 



A 



D7S1808 



11G: Panel B: Normal 




□■ in: Panel C: Normal 




□■ 12Y: Panel C: Tumor 



Y: 



4 ► 



FIG. 1 1 



GeneScan™ Projett-8/31 /QO Display- 1 



1100 110 120 130 140 150 160 170 1S0 190 200 210 220 230 
■ i i i i i i i i i i i i 



1200 
600; 
0. 

200. 
0. 

1600 



1 7B Panel A: Tumor 



400. 
200. 
0 



17G : Panel B: Tumor 



1000 
500 

OA 



BAT-26 




D10S1426 



□■ 1SY: Panel C: Normal 



□H i?Y: Panel C: Tumor 



X:220 Y:317 



D3S2432 




BAT-25 


I 


D7S3070 D7S3046 , 


MM 1SB . Panel A: Normal 




\J 


U.JjU_ _ . J\AaA/JL_ 



] MONO-15 l D1S518, 

^ A I j 

- ^ „... ,„ J V-,., _„ „.._ «>i-J 


i 


D7S1808 

I 


MM 18G : Panel B: Normal 


• -M. J 


I il 


1 JJUui 




i ► 



FIG. 12 



jneScan^ Project- 9/1/00 Display- 3 1 



120 



160 



200 



240 



1000 



500 




1000 



500 



BAT-25 



A 



D7S3070 



aJ 



1H 2B : 



1000 



500 



MONO-15 



D1S518 



1000. 



500 



D7S3046 



A. 



D7S1808 





D10S1426 




D3S2432 


BAT-26 




i 










..A 








^ — 







FIG. 13 



Declaration and Power of Attorney For Patent Application 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to 
my name. 

I believe I am an original, first and joint inventor of the subject matter which is 
claimed and for which a patent is sought on the invention entitled DETECTION OF 
MICROSATELLITE INSTABILITY AND ITS USE IN DIAGNOSING TUMORS 
(Attorney Docket No. 16026-9267), the specification of which is attached hereto. 

I hereby state that I have reviewed and understand the contents of the above 
identified specification, including the claims referred to above. 

I acknowledge the duty to disclose to the Patent and Trademark Office all 
information known to me to be material to patentability as defined in Title 37, Code of 
Federal Regulations, §1.56. 

And I hereby appoint Grady J. Frenchick (Reg. No. 29,018), Christopher B. 
Austin (Reg. No. 41,592), David L De Bruin (Reg. No. 35,489), Gerald L. Fellows 
(Reg. No. 36,133), Randall W. Fieldhack (Reg. No. 43,611), Joseph A. Gemignani 
(Reg. No. 19,482), Gregory J. Hartwig (Reg. No. P-46,761), Daniel S. Jones (Reg. 
No. 42,697), Richard L. Kaiser (Reg. No. 46,158), Timothy M. Kelley (Reg. No. 
34,201), Casimir F. Laska (Reg. No. 30,862), Edward R. Lawson, Jr. (Reg. No. 
41,931), Richard H. Marschall (Reg. No. 39,290), Thomas A. Miller (Reg. No. 
36,871), Kevin P. Moran (Reg. No. 37,193), Andrew R. Peret (Reg. No. 41,246), 
David R. Price (Reg. No. 31 ,557), Thomas S. Reynolds II (Reg. No. 45,262), David 
B. Smith (Reg. No. 27,595), Derek C. Stettner (Reg. No. 37,945), Billie Jean Strandt 
(Reg. No. 36,940), Sheldon L Wolfe (Reg. No. 43,996), Paul F. Donovan (Reg, No. 
39,962), Jill A. Fahrlander (Reg. No. 42,518), Karen B. King (Reg. No. 41,898), 
Linda Blair Meier (Reg. No. 39,769), Teresa J. Welch (Reg. No. 33,049), Robert S. 
Beiser (Reg. No. 28,687), Witold A. Ziarno (Reg. No. 39,888), and each or any of 
them, my attorneys or agents, with full power of substitution and revocation, to 
prosecute this application and to transact all business in the Patent and Trademark 
Office connected therewith. 

ADDRESS ALL COMMUNICATIONS IN OR PERTAINING TO THIS 
APPLICATION TO: 

Grady J. Frenchick 
Michael Best & Friedrich LLP 
One South Pinckney Street 
P. O. Box 1806 
Madison, Wl 53701-1806 



I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be true; 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of the application or any patent issued 
thereon. 



Full name of first joint inventor: Jeffrey W. Bacher 

Inventor's signature ___ 

Date: 

Residence: US 
Citizenship: US 

Post Office Address: 2633 Chamberlain Avenue 

Madison, Wl 53705 



Full name of second joint inventor: Laura Hennes 

Inventor's signature 

Date: . 

Residence: US 
Citizenship: US 
Post Office Address: 210 S. High Street 

Fort Atkinson, Wl 53538 



Full name of third joint inventor: Nadine Nassif 



Inventor's signature _ 
Date: 

Residence: 
Citizenship: 
Post Office Address: 



US 
US 

2601 Whitlock Road 
Madison, Wl 53719 



SEQUENCE LISTING 



<110> Bacher, Jeffery W. 
Hennes, Laura 
Nassif, Nadine 

<12 0> DETECTION OF MICRO SATELLITE INSTABILITY AND ITS USE IN 
THE DIAGNOSING TUMORS 

<130> 16026-9267 

<14 0> Unknown 
<141> 2000-09-15 

<160> 62 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> BAT-25 primer 
<400> 1 

tcgcctccaa gaatgtaagt 20 



<210> 2 

<211> 21 

<212> DNA 

<213> Homo sapiens 
<220> 

<223> BAT-25 primer 

<400> 2 

tctgcatttt aactatggct c 21 



<210> 3 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 



1 



<223> BAT-26 primer 
<400> 3 

tgactacttt tgacttcagc c 21 



<210> 4 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> BAT-26 primer 

<400> 4 

aaccattcaa catttttaac cc 



<210> 5 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> MONO- 11 primer 
<400> 5 

gagctgtgat tgcactacac 



<210> 6 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> MONO- 11 primer 
<400> 6 

ggcatgaatt actactgtcc tact 24 



<210> 7 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 



2 



<223> MONO-15 primer 
<400> 7 

tcagatttat tttgggcttc actc 



<210> 8 
<211> 18 
<212> DNA 
<213> Homo 

<220> 

<223> MONO 

<400> 8 
ggcggagctt 



sapiens 



15 primer 



gcagtgag 



<210> 9 

<211> 20 

<212> DNA 

<213> Homo sapiens 
<220> 

<223> D1S518 primer 

<400> 9 

tgcagatctt gggacttctc 



<210> 10 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D1S518 primer 

<400> 10 

aaaaagagtg tgggcaactg 



<210> 11 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<223> D1S547 primer 
<400> 11 

ctgaagtggg aggattgctt 

<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D1S547 primer 
<400> 12 

aattcagggg agttccagag 

<210> 13 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D1S1677 primer 
<400> 13 

agtcagcttg attgacccag 

<210> 14 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D1S1677 primer 
<400> 14 

cttagtgtga caggaaggac g 

<210> 15 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<220> 



<223> D2S1790 primer 



<400> 15 

acatgtcgat ctcagcgttc 

<210> 16 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D2S1790 primer 
<400> 16 

gagttttatt ggccaaagca 

<210> 17 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D3S2432 primer 
<400> 17 

ggcaggcagg tagatagaca 

<210> 18 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D3S2432 primer 
<400> 18 

acactaaaca agcatagtca ggc 



<210> 19 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

5 



<223> D5S818 primer 
<400> 19 

gggtgatttt cctctttggt 

<210> 20 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D5S818 primer 
<400> 20 

tgattccaat catagccaca 

<210> 21 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> D5S2849 primer 
<400> 21 

cctggaagaa ccaatgctta 

<210> 22 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D5S2849 primer 
<400> 22 

ttgagcccag aaagtttgag 

<210> 23 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<223> D6S1053 primer 
<400> 23 

tatttcaaag gcagcaaagc 

<210> 24 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D6S1053 primer 
<400> 24 

gcttgcagac agcctattgt 

<210> 25 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S1808 primer 
<400> 25 

cagaacaaac aaatggggag 



<210> 


26 


<211> 


20 


<212> 


DNA 


<213> 


Homo 



sapiens 



<220> 

<223> D7S1808 primer 
<400> 26 

ccaaataaga ctcaggacgc 



<210> 


27 


<211> 


20 


<212> 


DNA 


<213> 


Homo 



sapiens 



<223> D7S3046 primer 
<400> 27 

acatacggat gaatggatgg 

<210> 28 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S3046 primer 
<400> 28 

tataacctct ctccctatct 

<210> 29 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S3070 primer 
<400> 29 

cccccatgag ttattcctct 



<210> 30 

<211> 20 

<212> DNA 

<213> Homo sapiens 
<220> 

<223> D7S3070 primer 

<400> 30 

ggaagccaaa tgttgaattg 

<210> 31 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<220> 



<223> D8S1179 primer 



<400> 31 

tttttgtatt tcatgtgtac attcg 

<210> 32 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D8S1179 primer 
<400> 32 

cgtagctata attagttcat tttca 

<210> 33 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D9S2169 primer 
<400> 33 

ttcccaaaag ttgccatcta 

<210> 34 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D9S2169 primer 
<400> 34 

agcccaaaat gttatgcaag 

<210> 35 

<2ll> 20 

<212> DNA 

<213> Homo sapiens 



<223> D10S1426 primer 
<400> 35 

ttggtggtgt catcctcttt 



<210> 36 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<220> 

<223> D10S1426 primer 
<400> 36 

ctcttaactg atttggccga 



<210> 37 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D10S2470 primer 

<400> 37 

cctcctagct cctcaagctt 



<210> 38 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D10S2470 primer 
<400> 38 

caggacagat ttcctgtggt 



<210> 39 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<223> D12S391 primer 



<400> 39 

aacaggatca atggatgcat 

<210> 40 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D12S391 primer 
<400> 40 

tggcttttag acctggactg 

<210> 41 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D17S1294 primer 
<400> 41 

tggcatgcaa ttgtagtctc 



<210> 42 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> D17S1294 primer 
<400> 42 

ttctttcctt actaagttga gaacg 



<210> 43 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<223> D17S1299 primer 



<400> 43 

tagcacttga gcacacatgg 



<210> 44 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> D17S1299 primer 
<400> 44 

gtgcattatg gggaccatta 



<210> 45 

<211> 20 

<212> DMA 

<213> Homo sapiens 

<220> 

<223> D18S51 primer 
<400> 45 

gagccatgtt catgccactg 



<210> 


46 


<211> 


20 


<212> 


DNA 


<213> 


Homo 



sapiens 



<220> 

<223> D18S51 primer 
<400> 46 

caaacccgac taccagcaac 



<210> 


47 


<211> 


21 


<212> 


DNA 


<213> 


Homo 



sapiens 



<223> FGA primer 
<400> 47 

ccataggttt tgaactcaca g 21 



<210> 


48 


<211> 


20 


<212> 


DNA 


<213> 


Homo 



sapiens 



<220> 

<223> FGA primer 
<400> 48 

cttctcagat cctctgacac 20 



<210> 


49 


<211> 


25 


<212> 


DNA 


<213> 


Homo 



sapiens 



<220> 

<223> D1S518 primer 
<400> 49 

gtcaattcct tgttataaaa ttata 25 



<210> 50 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D1S518 primer 
<400> 50 

attggcaact gcattagagt tctc 24 



<210> 51 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 



13 



<223> D7S1808 primer 
<400> 51 

ggaggaaaag tcttaaacgt gaat 

<210> 52 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S1808 primer 
<400> 52 

attggccttg atgtgtttgt tact 



<210> 53 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S3070 primer 

<400> 53 

catttcttct gcccccatga 



<210> 54 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> D7S3070 primer 
<400> 54 

atttgacagc tgaaaaggtg cagatg 

<210> 55 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<220> 



<223> D7S3046 primer 
<400> 55 

gaggagacag ccagggatat a 

<210> 56 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D7S3046 primer 
<400> 56 

atttctctat aacctctctc cctatct 



<210> 57 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D10S1426 primer 

<400> 57 

ccccttggtg gtgtcatcct 



<210> 58 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> D10S1426 primer 
<400> 58 

attgccgatc ctgaagcaat age 

<210> 59 

<211> 24 

<212> DNA 

<213> Homo sapiens 



<220> 



<223> D3S2432 primer 



<400> 59 

attgtttgca tgtgaaacag gtca 



<210> 60 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> BAT- 2 5 primer 
<400> 60 

attctgcatt ttaactatgg ctct 



<210> 61 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> BAT-26 primer 
<400> 61 

tgactacttt tgacttcagc cagt 



<210> 62 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> BAT-26 primer 
<400> 62 

aaccaatcaa catttttaac cctt 



